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TITLE OH THE INVENTION 
I 

A PROCESSOR FOR EXECUTING INSTRUCTIONS IN UNITS THAT 
ARE UNRELATED TO THE UNITS IN WHICH INSTRUCTIONS ARE READ, 
AMD A COMPILER, AN OPTIMIZATION APPARATUS, AN ASSEMBLER, A 
LINKER, A DEBUGGER AND A DISASSEMBLER FOR SUCH PROCESSOR 

) 
1 

Thjls application is based on an application No, H10- 
118326 f!iled in Japan, the content of which is hereby 
incorporated by reference. 

BACKGROUND OF THE INVENTION 
1 ♦ Field of the Invention 



The present invention relates to a processor for 
executing instructions in units that are unrelated to the 
units inf which instructions are read, and a compiler, an 
optimization apparatus, an assembler, a linker, a debugger 
and a disassembler for such processor. 

! 
I 

2, Description of the Prior Art 

Processors conventionally read and execute 
instructions stored in memory according to a program 
counter. , Fig. 1 is a block diagram showing the basic 
construction of an example processor. 

The instruction memory 4301 stores four 8-bit 
instructions as one instruction packet. 

The! program counter 4300 indicates the address of an 
instruction packet in the instruction memory 4301. 



The instruction reading unit 4302 reads the 
instruction packet indicated by the program counter 4300 
from th€^ instruction memory 4301. 

i 

Thfe instruction executing unit 4303 executes all four 
instructions included in the read instruction packet in one 
cycle. i 

inj this way, a conventional processor can read an 
instruction packet that is indicated by the program counter 
and can execute four instructions in the instruction 

packet, i 

i 

Thf above processor has to execute all of the 
instructions in the read instruction packet in one cycle. 
Accordingly, when one or more instructions in an 
instruction packet cannot be executed due to problems with 
computer j system resources such as memory or I/O, none of 
the instructions in the instruction packet can be executed 
until such problems are resolved, This slows program 
execution. 

SUMMARY OF THE INVENTION 

In view of the stated problems, it is a primary 
object of the present invention to provide a processor that 
executes j instructions in units that are unrelated to the 
units in jwhich instructions are read from a program and a 
program c^^^iop^ent environment for generating suitable 
programs 

i 

This primary object is achieved by a processor for 



reading jinstructions from a memory according to a program 
counter/ the memory storing instructions in one-byte units, 
and for (executing the road instructions, the program 
counter ^including a first program counter and a second 
program counter, the first program counter indicating a 
storage position o£ a processing packet in the memory, the 
processing packet toeing composed of an integer number of 
the one-joyte units, the second program counter indicating a 
position of processing target instruction in the processing 
packet, the processing target instruction being an 
operation to be executed by the processor. 

Witph the stated construction, the first program 
counter indicates a storage position in the memory of a 
processing packet whose size is an integer number of bytes. 
Reads from the memory are performed based on this first 
program Y ounter f The second program counter can indicate 
any position of a processing target instruction included in 

the processing packet read from the memory. As a result/ 

i 

the instruction (s) to be executed can be freely set 
regardless of the amount of data read in one read 
operation* This means that instructions whose word length 
is not art integer number of bytes can be executed even when 
read operations from the memory to the processor are 
performed in units of an integer number of bytes . 

Here, the processor may include a first program 
counter qpdating unit and a second program counter updating 
unit, thej second program counter updating unit incrementing 



a value of the second program counter in accordance with an 
amount ojf instructions that were executed in a preceding 
cycle anki sending any carry generated in an incrementing to 
the first program counter updating unit, and the first 
program pounter updating unit adding the carry received 
from the' second program counter updating unit to the value 
of the first program counter. 

With the stated construction, the value of the 
program counter is incremented by the amount of 
instructions that have just been executed, so that the 
program fcounter can be updated to indicate the first 
position; of the instructions to be executed in the next 
cycle - ; 

Here, the processor may further include: a program 
counter relative value extracting unit for extracting, when 
an instruction being executed includes a program counter 
relative 1 value that is based on an address of a first 
instruction executed in a present cycle f the program 
counter relative value; and a calculating unit for adding 
the program counter relative value to the value of the 
first program counter and the value of the second program 
counter, and setting an addition result as the value of the 
first program counter and the value of the second program 
counter . ; 

Whel* the processor executes a branch instruction, the 
value of ;the program counter is added to a program counter 
relative rvalue that is a difference in addresses between 



the present branch instruction and the branch destination 
instruction. The result of this addition is then set as 
the new value of the program counter to have the program 
counter ; indicate the branch destination instruction. 

Hejre, the calculating unit may include a first 
calculating unit and a second calculating unit, the second 
calculating unit adding the value of the second program 
counter and lower bits of the program counter relative 
value, setting a result of an addition as the value of the 
second program counter, and sending any carry generated in 
the addition to the first calculating unit, and the first 
calculating unit adding the value of the first program 
counter, j upper bits of the program counter relative value, 
and any parry received from the second calculating unit, 
and settjLng a result of an addition as the value of the 
first program counter. 

When the processor executes a branch instruction and 

the program counter and a program counter relative value 

I 

are addetf, a carry generated when calculating the lower 
bits is properly considered when calculating the upper 
bits. In this way, addresses can be calculated with proper 
continuity between the calculation of the lower bits and 
the calculation of the upper bits. 

Herje, the calculating unit may include a first 
calculating unit and a second calculating unit, the second 
calculating unit adding the value of the second program 
counter ^nd lower bits of the program counter relative 



value without generating a carry, and setting a result of 
an addition as the value -of the second program counter, the 
first calculating unit adding the value of the first 
program 'counter and upper bits of the program counter 
relative; value, and setting a result of an addition as the 
value of the first program counter. 

When the processor executes a branch instruction, 
calculation of the lower bits of the value of the program 
counter ^nd the program counter relative value by the 
second calculating unit does not generate a carry to the 

calculation of the upper bits of the value of the program 

l 

counter &nd the program counter relative value by the first 
calculating unit* As a result, the calculations of the 
first and second calculators can be performed independently 
of one another, so that a simplified hardware construction 
can be used. 

Here, the calculating unit may add the value of the 
first program counter and upper bits of the program counter 

relative 5 value, sets a result of an addition as the value 

i 

of the first program counter, and sets lower bits of the 
program counter relative value as the value of the second 
program counter. 

When the processor executes a branch instruction, no 

calculation using the value of the second program counter 

S 

and the lower bits of the program counter relative value is 
required,, so that the processor can execute branch 
instructions at a higher speed. 

i 

6 



Hdre, the calculating unit may add the program 
counter 'relative value and a value whose upper bits are the 
value of the first program counter and lower bits are the 
value of the second program counter, and sets upper bits of 
a result of an addition as the value of the first program 
counter ;and lower bits of the result as the second program 
counter .> 

^ 

Whpn the processor executes a branch instruction, the 
calculation using the value of the program counter and the 
program counter relative value can be performed by a 
standard calculator* This means the hardware construction 
of the processor can be simplified. 

Here, the processor may further include: a program 
counter relative value extracting unit for extracting, when 
an executed instruction includes a program counter relative 

value that is based on an address of the executed 

t 

instruction, the program counter relative value; a program 

i 

counter amending unit for amending the value of the first 
program qounter and the value of the second program counter 
to indicate an address of the executed instruction; and a 
calculating unit for adding the program counter relative 
value, the value of the first program counter, and the 
value of the second program counter, and setting a result 
of an addition as the value of the first program counter 
and the value of the second program counter. 

The* program counter relative value is the difference 
in addresjses between a branch instruction and the branch 



destination instruction, so that it will not be necessary 
to chancfe the program counter relative value even when 
there id a change in the boundaries marking which 
instructions in the program will be executed in parallel. 

He^e, the processor may further include: a program 
counter relative value calculating instruction decoding 
unit for decoding a program counter relative value 
calculating instruction that performs an addition using a 
program pounter relative value and one of (a) a value of 
the progjram counter stored in a register, and (b) the value 
of the first program counter and the value of the second 
program fcounter; a calculating unit for performing the 

addition! indicated by the program counter relative value 

i 

calculating instruction to generate an addition result; and 
a program counter value updating unit for storing the 
addition I result in one of (a) the register, and (b) the 
first program counter and the second program counter. 

With the stated construction, it is possible to use 
an instruction that indicates a calculation using the value 
of the program counter and a program counter relative value 
in place -of an instruction that stores the absolute address 

of a function into a register* A program counter relative 

i 

value ha^ a shorter bit width that the absolute address of 
an instruction, so that the overall code size can be 
reduced. \ When using PIG codes where the addresses of 

instructions in memory are only determined when the program 

i 

is executed, absolute addresses cannot be used, so that 

i 

i 
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calculation instructions that use the program counter and a 
program ^counter relative value are essential. 

Here, the first program counter may indicate a memory 
address/, the memory address being a storage position in the 
memory of a processing packet that is given by bit shifting 
the valuje in the first program counter by log 2 n bits in a 
leftward direction, n being a length of a processing packet 
in bytes. 

With the stated construction, while separate 
addressed are assigned to each one-byte storage packet in 
the memory, the value of the first program counter 
corresponds with the address of a processing packet in the 
memory, j As a result, the processor can easily specify a 
processing packet in the memory, 

Hejfe, the processor may further include: an 
instruction buffer for temporarily storing instructions; 

i 

and an instruction reading unit for transferring 

i 

instructions with a minimum transfer sise of one one-byte 
unit from the memory to the instruction buffer, in 
accordance with available space in the instruction buffer 
but regardless of a size of a processing packet . 

With the stated construction, the amount of data read 
by the processor from the memory in one read operation can 
k e freelij set, so that the construction in the processor 
for reading instructions can be made highly flexible. 

Thej stated primary object can also be achieved by an 
instruction sequence optimizing apparatus, for generating 



optimized code from an instruction sequence, including: an 
address Assigning unit for estimating a size of each 
instruction in the instruction sequence and assigning an 
address ! to each instruction, upper bits of each address 
indicating a memory address at which a processing packet is 
stored and lower bits of each address indicating a 
processing target instruction in the processing packet; a 
label detecting unit (1) for detecting a label, which 
should bp resolved by an address of a specified 
instruction, from the instruction sequence, and obtaining 
the addrtess of the specified instruction, and (2) for 
detecting a label, which should be resolved by a difference 
in addresses of two specified instructions, from the 
instruction sequence, and obtaining the addresses of the 
two specified instructions; a program counter relative 
value calculating unit for calculating, when a label which 
should b£ resolved by a difference in addresses of two 
specified instructions has been detected, a program counter 
relative value by subtracting an address of one o£ the two 
specified instructions from an address of another of the 
two specified instructions; a converting unit (1) for 
converting an instruction that has a label that should be 
resolved ;by an address of a specified instruction into an 

instruction with a size that is based on a size of the 

j 

address of the specified instruction, (2) for converting an 
instruction that has a label that should be resolved by a 
differenqe in addresses of two specified instructions into 
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an instruction with a size that is based on a size of the 
program counter relative value calculated from the 
addresses of the two specified instructions; and an 
optimized code generating unit for generating optimized 
code by 'converting addresses of instructions in accordance 
with the; sizes of instructions after conversion by the 
converting unit. 

The above construction achieves an optimization 
apparatus for generating programs for a processor that 
executes branch instructions . 

i 

Hejre, the program counter relative value calculating 
unit may: include a lower bit subtracting unit and an upper 
bit subtracting unit, the lower bit subtracting unit 
subtracting lower bits of the address of the one of the two 
specified instructions from lower bits of the address of 
the other of the two specified instructions, for setting a 

result of a subtraction as lower bits of the program 

j 

counter Relative value , and sending any carry generated in 
the subtraction to the upper bit subtracting unit, and the 
upper bit subtracting unit subtracting upper bits of the 
address of one of the two specified instructions and any 
carry received from the lower bit subtracting unit from 
upper bits of the address of the other of the two specified 
instructions, and for setting a result of a subtraction as 
upper bitis of the program counLer relative value - 

Thel above construction achieves an optimization 

i 

apparatus for generating programs for a processor which, 

i 



when executing a branch instruction, calculates the address 
of a branch destination instruction using a carry method - 

Here, the program counter relative value calculating 
unit may include a lower bit subtracting unit and an upper 
bit subtracting unit, the lower bit subtracting unit 
subtracting lower bits of the address of one of the two 

specified instructions from lower bits of the address of 

1 

the othet of the two specified instructions without 
generating a carry and setting a result of a subtraction as 
lower biits of the program counter relative value, and the 
upper bit subtracting unit subtracting upper bits of the 
address of one of the two specified instructions from upper 

i 

bits of jthe address of the other of the two specified 
instructions, and for setting a result of a subtraction as 
upper bx^s of the program counter relative value* 

Th^ above construction achieves an optimisation 
apparatus for generating programs for a processor which, 

! 

when executing a branch instruction/ calculates the address 

i 

of a branch destination instruction without using a carry. 

Here, the program counter relative value calculating 
unit may: subtract upper bits of an address of one of the 
two specified instructions from upper bits of an address of 
the other of the two specified instructions, set a result 
of a subtraction as upper bits of the program counter 
relative lvalue, and set lower bits of the other of the two 

i 

specified instructions as lower bits of the program counter 
relative lvalue. 
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I 

I 
i 

Thle above construction achieves an optimization 

i 

apparatus for generating programs for a processor which, 
when executing a branch instruction, calculates the address 
of a bra!nch destination instruction using an absolute 
5 value . . 

The stated primary object can also be achieved by an 
assembler that generates relocatable code from an 
instruction sequence, each address of an instruction in the 
instruction sequence having upper bits that indicate a 
10 memory address at which a processing packet is stored and 
! ? lower bits that indicate a position of processing target 

.ri: . 

U instruction that is included in the processing packet, the 

J assemble*: including: a label detecting unit for detecting a 

j| label in! the instruction sequence that should be resolved 

Fi6 by a difference in addresses between two specified 

□ instructions, and obtaining the addresses of the two 

jlj specified instructions; a program counter relative value 

l % calculating unit for calculating a program counter relative 

iO value by J subtracting an address of one of the two specified 

20 instructions from an address of another of the two 

specified instructions; and a replacing unit for replacing 

i 

the labe^ with the program counter relative value 
calculated by the program counter relative value 

i 

calculating unit. 

s 

25 Thei above construction achieves an assembler for 

j 

generating programs for a processor that executes branch 
instructions . 

: 13 

t 



Hejre, the program counter relative value calculating 
unit may include a lower bit subtracting unit and an upper 

i 

bit subtracting unit, the lower bit subtracting unit 

j 

subtracting lower bits of the address of the one of the two 
5 specified instructions from lower bits of the address of 

the othejr of the two specified instructions, for setting a 
result df a subtraction as lower bits of the program 
counter relative value, and sending any carry generated in 
the subtraction to the upper bit subtracting unit, and the 
10 upper bijt subtracting unit subtracting upper bits of the 
B address bf one of the two specified instructions and any 
1 carry received from the lower bit subtracting unit from 
Z upper bijbs of the address of the other of the two specified 

3 instructions, and for setting a result of a subtraction as 

15 upper bits of the program counter relative value. 

2he above construction achieves an assembler for 
* generating programs for a processor which, when executing a 

I branch instruction, calculates the address of a branch 

destination instruction using a carry method* 
2 0 Here, the program counter relative value calculating 

unit may | include a lower bit subtracting unit and an upper 

i 

bit subtracting unit, the lower bit subtracting unit 
subtracting lower bits of the address of one of the two 
specified instructions from lower bits of the address of 

2 

25 the otheij of the two specified instructions without 

generating a carry and setting a result of a subtraction as 

i 

lower bit^s of the program counter relative value, and the 

! 

\ H 

i 
> 

I 

l 



upper bit subtracting unit subtracting upper bits of the 
address !of one of the two specified instructions from upper 
bits of ithe address of the other of the two specified 
instructions, and for setting a result of a subtraction as 
upper bits of the program counter relative value. 

The above construction achieves an assembler for 
generating programs for a processor which, when executing a 
branch instruction/ calculates the address of a branch 
destination instruction without using a carry, 

Hefe, the program counter relative value calculating 
unit may 1 ; subtract upper bits of an address of one of the 

two specified instructions from upper bits of an address of 

i 

the othet of the two specified instructions, set a result 
of a subtraction as upper bits of the program counter 

relative- value, and set lower bits of the other of the two 

■> 

specified instructions as lower bits of the program counter 

i 

relative \ value . 

Th^ above construction achieves an optimisation 

i 

apparatus for generating programs for a processor which, 
when executing a branch instruction, calculates the address 
of a branch destination instruction using an absolute 
value . t 

The stated primary object can also be achieved by a 
linker ttjat generates object code by combining relocatable 

code, each address of an instruction in the relocatable 

f 

code havijng upper bits that indicate a memory address at 
which a processing packet is stored and lower bits that 

i 
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indicate! a position of processing target instruction that 
is included in the processing packet, the linker including: 
a relocation information detecting unit for detecting a 
label iri the relocatable code that should be resolved by a 
5 difference in addresses between two specified instructions, 

S 
( 

and obtaining the addresses of the two specified 
instructions; a program counter relative value calculating 
unit for! calculating a program counter relative value by 
subtracting an address of one of the two specified 
10 instructions from an address of another of the two 

J specified instructions; and a replacing unit for replacing 

i i 

t the labe|L with the program counter relative value 

] calculated by the program counter relative value 

) calculating unit. 

15 Th<? above construction achieves a linker for 

generating programs for a processor that executes branch 
instructions, 

: I 

;. S 

Here, the program counter relative value calculating 
unit laayj include a lower bit subtracting unit and an upper 

20 bit subtracting unit, the lower bit subtracting unit 

subtracting lower bits of the address of the one of the two 
specified instructions from lower bits of the address of 
the other of the two specified instructions, for setting a 
result of a subtraction as lower bits of the program 

25 counter Relative value, and sending any carry generated in 
the subtraction to the upper bit subtracting unit, and the 
upper bitj subtracting unit subtracting upper bits of the 

16 

I 

| 



i 

address jof one of the two specified instructxons and any 
carry received from the lower bit subtracting unit from 
upper bits of the address of the other of the two specified 
instructions, and for setting a result of a subtraction as 

i 

5 upper bo^ts of the program counter relative value. 

Thb above construction achieves a linker for 
generating programs for a processor which, when executing a 
branch instruction, calculates the address of a branch 
destination instruction using a carry method. 
10 Here, the program counter relative value calculating 

w unit may! include a lower bit subtracting unit and an upper 

W bit subtracting unit, the lower bit subtracting unit 

13 | 

U subtracting lower bits of the address of one of the two 

m specifieji instructions from lower bits of the address of 

15 the other of the two specified instructions without 
□ generating a carry and setting a result of a subtraction as 

ill lower bits of the program counter relative value, and the 

% upper bit subtracting unit subtracting upper bits of the 

! P address fc>£ one o£ the two specified instructions from upper 

20 bits of the address of the other of the two specified 

instructions, and for setting a result of a subtraction as 
upper bi-jis of the program counter relative value. 

Th^ above construction achieves a linker for 
generating programs for a processor which, when executing a 

25 branch instruction, calculates the address of a branch 

? 

destination instruction without using a carry. 

Here, the program counter relative value calculating 
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unit mays subtract upper bits of an address of one of the 
two specified instructions from upper bits of an address of 
the other of the two specified instructions, set a result 
of a subtraction as upper bits of the program counter 
relative value, and set lower bits of the other of the two 

specified instructions as lower bits of the program counter 

i 

relative value. 

Th4 above construction achieves a linker for 
generating programs for a processor which, when executing a 
branch instruction, calculates the address of a branch 
j i- - destination instruction using an absolute value. 
W Th4 stated primary object can also be achieved by a 

S disassembler that receives an indication of an address of 

S an instruction in object code and outputs an assembler name 

! Pl5 of the instruction at the indicated address, each address 
□ of an instruction in the object code having upper bits that 

if] indicate, a memory address at which a processing packet is 

j ~ stored and lower bits that indicate a position of 

;J3 procossipg target instruction that is included in the 

20 processing packet, the disassembler including: a program 

counter Relative value extracting unit for extracting, when 
the indicated instruction includes a program counter 
relatives value, the program counter relative value from the 
indicated instruction; a label addressing calculating unit 
for adding an address of the indicated instruction to the 
extracted program counter relative value and setting an 
addition j result as a label address; a storing unit for 
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storing 



a label name corresponding to each label address; 
and a searching unit for searching the storing unit for a 
label name that corresponds to the calculated label address 
and outputting Lhe corresponding label name. 

The stated construction can disassemble a program 
that includes a branch instruction. When the disassembled 

instruction is a branch instruction, the address of the 

i 

branch destination instruction can be calculated from the 
program counter relative value. This address is then used 
to search the label table and so obtain the label narae. As 
a result> the branch destination can be displayed to the 
user in the readily understandable form of a label name, 
even wheh program counter relative values are used in 
branch instructions. 

He£e, the label address calculating unit may include 

r 

a lower bit calculating unit and an upper bit calculating 
unit, the lower bit calculating unit for adding lower bits 
of the address of the indicated instruction and lower bits 
of the program counter relative value, setting a result of 
an addition as lower bits of a label address, and sending 
any carry generated by the addition to the upper bit 
calculating unit, and the upper bit calculating unit adding 
upper bits of the address of the indicated instruction, 
upper bits of the program counter relative value, and any 
carry received from the lower bit calculating unit, and 
setting a result of the an addition as upper bits of the 
label address. 
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Thfe above construction achieves a disassembler that 
can disassemble programs for a processor which, when 
executing a branch instruction, calculates an address of a 
branch destination instruction using a carry. 

Here, the label address calculating unit may include 
a lower bit calculating unit and an upper bit calculating 
unit, the lower bit calculating unit adding lower bits of 

the address of the indicated instruction and lower bits of 

i 

the program counter relative value without generating a 
carry, and setting a result of an addition as lower bits of 
a label ^ddress, and the upper bit calculating unit adding 

upper bits of the address of the indicated instruction and 

i 

upper bits of the program counter relative value, and 

i 

setting ^ result of an addition as upper bits of the label 
address. j 

The above construction achieves a disassembler that 
can disassemble programs for a processor which, when 
executing a branch instruction, calculates an address of a 
branch destination instruction without using a carry. 

Here, the label address calculating unit may add 
upper bits of the address of the indicated instruction and 
upper biips of the program counter relative value, set a 
result of an addition as upper bits of the label address, 
and set |ower bits of the program counter relative value as 
lower biips of the label address. 

The; above construction achieves a disassembler that 
can disassemble programs for a processor which, when 

i 
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executing a branch instruction, calculates an address of a 

j 

branch destination instruction using an absolute value. 

This stated primary object can also be achieved by a 
debugged that receives an indication of an address of an 
instruction in object code and replaces the instruction at 
the indicated address with a replacement instruction, each 
address jof an instruction in the object code having upper 
bits tha^t indicate a memory address at which a processing 
packet is stored and lower bits that indicate a position of 
processing target instruction that is included in the 
processing packet, the debugger including: a processing 
packet reading unit for reading a processing packet that is 
indicate!* by upper bits of the indicated address from the 

i 

memory and writing the processing packet into an 
instruction buffer; an instruction writing unit for writing 
the replacement instruction into the processing packet in 
the instruction buffer over an instruction that is 

i 

indicated by the lower bits of the indicated address; and a 
processing packet writing unit for writing the processing 
packet x^i the instruction buffer back into the memory after 
the replacement instruction has been written. 

Thes above construction reads instructions in units of 

S 

processing packets from a memory that stores instructions 
in one-b^te storage packets, rewrites instructions in an 
instruction buffer, and writes instructions back into the 
memory irj. units of processing packets. This achieves a 

debugger :that can debug instructions whose length is not an 

i 
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integer jnumber of bytes. 

The stated primary object can also be achieved by a 
compiler that generates an instruction sequence from source 
code, the compiler generating a program counter relative 
value calculating instruction that is executed by a 
processor, the program counter relative value calculating 
instruction being an instruction that performs a 
calculation using a first value and a program counter 
relative } value and uses a result of the calculation to 
update the first value, the first value being one of {a) a 
value of | a program counter stored in a register, and (b) 
the valup stored in a program counter of the processor, 
wherein upper bits of the first value indicate a memory 
address jat which a processing packet is stored, and lower 
bits of the first value of the program counter indicate a 
processing target instruction that is included in the 
processing packet. 

The above construction achieves a compiler that 
generates programs for a processor that executes program 
counter relative value calculating instructions. 

Here, the processor may include a lower bit 
calculating unit and an upper bit calculating unit, the 
program counter relative value calculating instruction 
having the lower bit calculating unit perform a lower bit 
calculation and the upper bit calculating unit perform an 
upper bitj calculation, the lower bit calculation being an 
addition ^using lower bits of the first value and lower bits 

i 
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of the vialue of the program counter relative value , where a 
result cyf the lower bit calculation is set as the lower 
bits of the first value and any generated carry is sent to 
the upper bit calculating unit, and the upper bit 
calculation being an addition using upper bits of the first 
value, ujpper bits of the value of the program counter 
relative! value and any carry received from the lower bit 
calculating unit, where a result of the upper bit 
calculation is set as the upper bits of th£ first value- 
Th^ above construction achieves a compiler that 

generates a program for a processor which, when executing a 

i 

program counter relative value calculating instruction, 
performs| a calculation using a value of the program counter 
and the program counter relative value according to a carry 

method, » 

i 

Here, the processor may include a lower bit 
calculating unit and an upper bit calculating unit, the 
program cpounter relative value calculating instruction 
having -tike lower bit calculating unit perforin a lower bit 
calculation and the upper bit calculating unit perform an 
upper bit calculation, the lower bit calculation being an 
addition using lower bits of the first value and lower bits 
of the value of the program counter relative value that 
does not I generate a carry, where a result of the lower bit 
calculation is set as the lower bits of the first value, 
and the upper bit calculation being a calculation using 
upper bits of the first value and upper bits of the value 
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of the fjrogram counter relative value, where a result of 
the uppejr bit calculation is set as the upper bits of the 
first value. 

The above construction achieves a compiler that 
generates a program for a processor which, when executing a 
program (Counter relative value calculating instruction, 
performs* a calculation using a value of the program counter 
and the [program counter relative value without generating a 
carry. 

Hefe, the processor may includes an upper bit 
calculating unit, the program counter relative value 
calculating instruction having the upper bit calculating 
unit perform an upper bit calculation and setting lower 

bits of jthe program counter relative value as lower bits of 

j 

the first value, and the upper bit calculation being an 
addition| using upper bits of the first value and upper bits 
of the value of the program counter relative value, where a 
result of the upper bit calculation is set as the upper 
bits of |zhe first value. 

Th$ above construction achieves a compiler that 
generates a program for a processor which, when executing a 
program counter relative value calculating instruction, 
performs a calculation using a value of the program counter 
and the program counter relative value according to an 
absolute ; value calculating method . 

i 



BRIEF DESCRIPTION OF THE DRAWINGS 

i 

Thbse and other objects, advantages and features of 
the invention will become apparent from the following 
description thereof taken in conjunction with the 

accompanying drawings which illustrate a specific 

i 

embodiment of the invention* In the drawings: 

Fig. 1 is a block diagram showing the construction of 

a conventional processor; 

Fig. 2A shows the format of one instruction executed 

by the processor of the first embodiment of the present 

invention; 

Fig. 2B shows the format of another instruction 
executed 1 by the processor of the first embodiment of the 
present pivention; 

Fig. 2C shows the format of another instruction 
executed! by the processor of the first embodiment of the 
present invention ; 

Fig, 2D shows the format of another instruction 
executed | by the processor of the first embodiment of the 
present invention; 

Fig. 2E shows the format of another instruction 

i 

executed 4 by the processor of the first embodiment of the 
present invention; 

Fig. 3A shows an instruction packet that is the unit 

i 

used for j storing and reading instructions in this first 
embodiment; 

i 

Figi. 3B shows the read order of instructions; 

i 

25 



Fig. 3C shows the execution order of instructions; 

i 

Fi&. 4 shows an example of the methods used by a 
conventional processor to store and read instructions that 
are not byte-aligned; 
5 F±|;. 5 shows the procedure by which the object code 

to be executed by the processor is generated by a compiler, 
optimization apparatus, assembler, and linker; 

Ficj. 6 is a block diagram showing the details of the 
processor 309 and the external memory; 

10 Fig. 7 is an increment table showing the rules used 

I 

□ to increment the in-packet address; 

ijj Fi^. 8A is an addition table showing the addition 

:J rules used when adding the lower 3 bits of the address of a 

K branch instruction to lower 3 bits of the PC relative 

JpL5 value; j 

Ficjr- 8B is a subtraction table showing the 
:f! subtraction rules used when subtracting the lower 3 bits of 

ill the PC relative value from the lower 3 bits of a branch 

,q destination address; 

20 Fig. 9 is a block diagram showing the components and 

input /output data of the optimization apparatus 303; 

Ficjf. 10 is a flowchart showing the operation 
procedure of the optimization apparatus; 

Fig, 11 shows part of the optimization processing 

1 

25 code 903 ' generated by the code optimisation apparatus 902; 
Fig. 12 shows the address assigned codes 916 
generated from the optimisation processing code 903 shown 



in Fig. 11; 

Fig, 13 shows the label information 906 generated 
from the address assigned codes 916 shown in Fig. 12; 

Fi£. 14 shows the optimized code 304 generated from 
5 the address assigned codes 916 shown in Fig, 12; 

i 

Fig. 15 is a block diagram that shows the 

s 

construqtion of the assembler 305 shown in Fig. 5 and the 
input /output data related to the assembler 305; 

Fig. 16 is a flowchart showing the operation of the 
10 assembler; 

* Fijj. 17 shows the machine language codes 803 that are 

J generated from the optimized code 304 shown in Fig. 14; 

a Fig* 18 shows the label information that is generated 

i from the; machine language codes shown in Fig- 17; 

s 15 Fig. 19 shows the relocatable codes that are 

1 generate^ from the machine language codes 803 shown in Fig. 

1 17; i 

i Fi^, 20 is a block diagram showing the construction 

3 of the linker 307 and the I/O (input /output) data of the 

20 linker 3<?7; 

Fig. 21 is a flowchart showing the operation of the 
linker 307; 

Figt- 22 shows the relocatable codes; 

! 
I 

Fiqj. 23 shows the state when the relocatable codes 

j 

25 8 1 4 shown in Fig. 19 have been combined with the 
relocatable code shown in Fig* 22; 

Fig;. 24 shows the resulting coflabined codes 703; 
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Fi£. 25 shows the label information that is generated 

E 

from the combined codes 703 shown in Fig. 24; 

j 

Fig, 26 shows the object codes generated from the 

i 

coiobinedj codes 703 shown in Fig. 24; 

Fi<p. 27 shows the object code generated by the second 
embodiment of the present invention; 

Fig. 28A shows the construction of an instruction 
packet iji the third embodiment; 

Fig. 28B shows the types of instructions used in the 

i 

third embodiment; 

Fi&, 28Q shows the relation between in-packet 

addressed and the instruction units in a packet; 

j 

Fi<jT« 2 9A is an addition table showing the addition 
rules fo|r adding the lower 3 bits of the address of the 
branch instruction and the lower 3 bits of the PC relative 
value in! the calculation method of the fourth embodiment 
that doe£ not use a carry; 

Fi^. 29B is a subtraction table showing the 
subtraction rules for subtracting the lower 3 bits of the 
address <j>f the branch instruction from the lower 3 bits of 
the address of the branch destination instruction in the 
calculation method of the fourth embodiment that does not 
use a ca^ry; 

Figj. 30 shows the object code that is generated by 
the address calculation method of the fourth embodiment 
that does not use a carry; 

Fig r 31A is an addition table showing the addition 
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rules fop adding the lower 3 bits of the address of the 

i 

branch instruction and the lower 3 bits of the PC relative 
value in! the calculation method of the fifth embodiment 
that uses absolute values ; 
5 Fig. 31B is a subtraction table showing the 

subtraction rules for subtracting the lower 3 bits of the 
address of the branch instruction from the lower 3 bits of 
the addrfcss of the branch destination instruction in the 
calculation method of the fifth embodiment that uses 
10 absolute values; 
y. Ficr. 32 shows the object code that is generated by 

kj the above address calculation method of the fifth 

m ) 

u embodiment that uses absolute values; 

hi ! 

Fig, 33 shows the object code that has been generated 

: M5 using th$ linear calculation method of the sixth 

ii i 

O embodiment; 

m Fig. 34 shows the processor of the seventh 

! 2 embodiment; 

;Q Figj. 35A shows the operation that corresponds to a PC 

20 adding instruction which is shown in mnemonic form; 

Fig. 35B shows the operation that corresponds to a PC 
subtracting instruction which is shown in mnemonic form; 

Fig|. 36 shows the construction of the compiler of the 
eighth embodiment of the present invention; 

Figj. 37 is a flowchart showing the operation of the 
compiler;! 

Fig!, 38 shows source code which is written in C 
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languagej; 

Fiij. 39 shows the intermediate codes that have been 
generated from the source program shown in Fig. 38; 

Fig. 40 shows the assembler code that has been 
produced by converting the intermediate codes shown in Fig- 
39; 

i 

Fi£. 41 is a block diagram showing the construction 
of the debugger and disassembler of the present embodiment; 

Fig. 42 is a flowchart showing the operating 
procedure of a disassembler of the present invention; and 

Fig. 43 is a flowchart showing the operation of the 
debugger! of the present invention. 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 

i 

Thf following is a detailed description of several 

I 

embodiments of the present invention, with reference to the 
accompanying drawings* 

First Embodiment 
1 

This first embodiment relates to an optimization 

I 

apparatu^p, an assembler, and a linker that generate 
programs ! where read operations and execute operations have 
different units, and to a processor for executing such 
programs i 
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I 

Formats \>f the Instructions Executed by the Processor 

The following explains the formats of the 
instructions executed by the processor of this first 
embodimeht . These formats are shown in Figs. 2A - 2E. The 
5 instructions executed by the present processor are 

constructed so that 21 bits is set as one instruction unit. 
For the present processor/ there are both one-unit (i.e., 
21-bit) and two-unit {i.e./ 42-bit) instructions. 

The format information 101 is written as one bit and 
10 shows this length of each instruction. When the format 
O information 101 is "0", this shows that the unit including 

y this format information 101 forms one complete instruction, 
y; which isl to say, a 21-bit instruction* When the format 

^ information 101 is "1", this shows that the unit including 

Il5 this forjnat information 101 and the following unit together 
p form onej two-unit instruction, which is to say f a 42-bit 

m instruction . 

! i™ Th£ parallel execution boundary information 100 is 

;0 also written as one bit and shows whether a parallel 

20 executiop boundary exists between the instruction formed by 
the presjent unit and the following instruction. When the 

5 

parallelj execution boundary information 100 is "1", this 
shows that a parallel execution boundary exists between the 
instruction including this parallel execution boundary 
25 information 100 and the following instruction, so that 

these instructions will be executed in different cycles* 
When the! parallel execution boundary information 100 is 
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w 0", this shows that no parallel execution boundary exists 
between the instruction including this parallel execution 
boundary! information 100 and the following instruction, so 
that these instructions will be executed the same cycle- 

The remaining bits in each instruction are used to 
show an operation. This means that 19 bits can be used to 
indicate; the operation in a 21-bit instruction and that 4 0 
bits can; be used to indicate the operation in a 42-bit 
instruction. The fields marked w Opl'\ fT Op2 n / "OpS", and 
"Op4 u are used to store opcodes that indicate the type of 
operation to be performed. The field marked "Rs" is used 
to store! the register number of a register used as the 
source operand and the field marked "Rd" is used to store 
the register number of a register used as the destination 
operand. The fields marked ,f imm5" and "inun32" are 
respectiyely used to store 5-bit and 32-bit immediates that 
are used; in calculations. Finally, the fields marked 
"displ3" land t *disp32" are respectively used to store 13-bit 
and 32-bit displacements. 

Transfer instructions and arithmetic instructions 
that handle long (such as 32-bit) constants and branch 
instructions that use large displacements are defined as 

42-bit instructions. Most other instructions are defined 

i 

as 21-bit instructions. Of the two units used to compose a 
42-bit instruction, the latter unit is only used to store 
part of tjhe long constant or displacement, and so does not 
store the opcode of the instruction. 

t 
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Reading ftnd Execution of Instructions by the Processor 

Th4 following explains the operation of the present 
processor when reading and executing instructions. Note 
that the * processor of the present embodiment has a premise 
5 that static parallel scheduling is used. Fig* 3A shows an 
instruction packet that is the unit used for storing and 

i 

reading instructions. Each instruction packet is composed 
of three! instruction units (63 bits) and dummy data (1 
bit) , In each cycle, the processor reads instructions 
10 using this fixed 64-bit packet length. Packets of this 
12 size are; used because the 21-bit unit size of instruction 
y is not suited to reading from memory. Accordingly, a 
;~/ number of such instructions are read together with dummy 
U data to ijiake the total packet size equal to an integer 
; p number o^ bytes. In this example, since the number of 
q instruction units in each instruction packet is not a power 
|t of two, inhere is the following special effect. This effect 
ig overcome^ the problems that occur when positions of the 
\Q units inside instruction packets are expressed using 

20 binary. I In the following explanation, the three units in 
an instruction packet are called the first/ second and 
third un^ts in order starting from the unit with the lowest 

address value, 

j 

Figj« 3B shows the read order of instructions. As 
25 shown in jthe figure, one instruction packet is read in each 
cycle, | 

Fig!. 3C shows the execution order of instructions. In 

i 
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each cycle, instructions are executed as far as the next 
parallel- execution boundary. This means that the 
instructions are executed up to and including an 
instruction whose parallel execution boundary information 
100 is "il w - Instruction units that are read but not 
executed are accumulated in the instruction buffer, and are 
executed in a later cycle* 

As ! described above, the processor of the present 
embodiments reads instructions using packets of a fixed 
length, but only executes a suitable number of units in 
each cycle depending on parallelism of the instructions. 
The reason that the present processor can start the 

s 

executiopi of instructions in one cycle at any of the 
instruction units in an instruction packet is that an in- 
packet address specifies an instruction unit in an 
instruction packet. This is described in more detail 
later. j 

Fi^f. 4 shows an example of the methods used by a 
conventional processor to store and read instructions that 
are not byte-aligned. When 21-bit instructions that are 

i 

not byteyaligned are to be read in byte-units , three unused 
bits hav^ to be added to the end of each instruction to 
make the { instruction length 24-bits. This means that what 
are essentially 21-bit instructions are stored into and 
read froiji memory in 24-bit units* The length of three of 
such instructions is 72 bits, so that the storage of three 
instructions in a 64-bit packet in the present embodiment 

i 



reduces Overall program size. 

Note that while the present embodiment describes the 
packet construction when 21-bit instructions are used, the 
invention is not limited to this instruction length. It is 
equally possible to construct instruction packets of 
instructions of a different length and to read the 
instructions using such instruction packets. As one 
example, f when instructions are n-bits long, values of m and 
r may be> selected so as to give a maximum value of 
n*m* (n*m+r) subject to (n*xu+r5mod8=0 - One packet is then 
composed; of m instruction units (each being n bits long) 
and r-bit dummy data. By doing so, instruction packets can 
be composed of multiple-byte size using relatively little 
dummy dapa. 
) 

Method ffrr Expressing Instruction Addresses 

Th$ following explains the method used to express 
instruction addresses in the present embodiment. Here, an 
instruction address refers to the address used Lo specify 
the position of a unit and is expressed as 32 bits. 

The upper 29-bits of a 32-bit address are used to 

i 

specify an instruction packet and so are called the "packet 
address 11 ^ This packet address is expressed as a 29-bit 
hexadecimal figure in a format such as "29 r h01234567 ,f . A 
value produced by shifting the value of this packet address 
by 3-bit** to the left is the memory address at which the 

instruction packet is stored. 

\ 
t 
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Th£ lower 3-bits in a 32-bit address are used to 
specify |an instruction unit included in the instruction 
packet apid so are called the "in-packet address". This in- 
packet Address is expressed as a 3-bit binary value in a 
5 format such as "3 f b001 n . As examples, the in-packet 

address i^'bOOl" specifies the first unit in an instruction 
packet, fthe in-packet address "3^010" specifies the second 
unit, and the in-packet address "S'blOO" specifies the 
third uriit. However, the in-packet addresses are not 
10 limited !to these specific values. Other values may be used 
D provided that the instruction units in an instruction 

y packet ^re each specified using their own value, 

y Thfe indicating of addresses in this embodiment is 

II such thaft only 3 bits are assigned for eight-bytes of 

>pl5 instructions . This gives the same results as when a 
h conventional processor assigns a separate address to each 

jh byte, since the upper 29-bits of addresses assigned to 

V eight-b^tes of instructions will be the same. 

20 Method for Generating the Object Code Executed by the 
Processor 

! — 

The following explains the method for generating the 
object c|ode that is executed by the processor of the 
present ; embodiment. 
25 First, the terminology to be used in this explanation 

is defined. 

A l"PC relative value" is the difference between the 
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addresses of two instructions. 

A * label" is either an "instruction address-resolved 

label" or a "PC relative value-resolved label". Absolute 
address-resolved labels are replaced with absolute 
addresses of instructions during the processing that 
converts, a program into object code. An example of such a 
label is; the label "L2" in the transfer instruction "mov 
L2,rl" that transfers an instruction stored in memory to 
the register rl. PC relative value-resolved labels are 

i 

replaced! with PC relative values during the processing that 
converts \ a program into object code. An example of such a 
label is | the label "Ll" in the unconditional branch 

i 

instruction "bra LI" that performs an unconditional branch 
using the PC relative value, "Local labels" and "external 
labels" ^Iso exist as other types of label- When a label 
and the instruction including the label are included in the 
same module {a module being a subprogram composed of an 
instruction sequence achieving one processing function) , 
such label ±s called a local label, while when the label 
and instruction including the label are included in 
different modules, such label is called an external label. 

Ficp 5 shows the procedure by which the object code 
to be executed by the processor is generated by a compiler, 
optimization apparatus, assembler, and linker. An overview 
of the functions of these components is given below. 

Th^ compiler 301 analyzes the content of the source 
code 300 | that is written in a high-level language like C 
t 
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and outpjits assembler code 302* 

Th^ optimization apparatus 303 assigns temporary 
addressed to the asserobler code 302, links the instruction 
sequences in groups of three instruction units , and outputs 

optimized code 304 as the linked results. In this process, 

if 

local labels are calculated as PC relative values or 

I 

instruction addresses* The instruction size, which is to 
say, whether an instruction should be expressed as a one- 
unit instruction or as a two-unit instruction, is then 
determined based on the value of the PC relative value or 
the instruction address. 

Th$ assembler 305 outputs relocatable codes 306 which 
it generates from the optim i zed code 304. This processing 
converts; local labels that should be resolved with PC 

relative; values into PC relative values. 

i 

Th^ linker 307 combines a plurality of modules, That 
is, the Jinker 307 combines a plurality of relocatable 
codes 3Q£ and outputs the resulting object code 308. In 
this processing, unresolved labels are converted into PC 
relative [ values or instruction addresses. 

Th^ processor 309 executes the object code 308, 
As J described above, a program written in a high-level 
languagej is converted by the compiler 301, the optimization 
apparatus 303, the assembler 305, and the linker 307 into 
object code that is in a format executable by the 
processor . Each label in the program is converted into a 

PC relative value or an instruction address by one of the 

I 

! 
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steps in! the above procedure * Address resolution for local 
labels tbat should be resolved by a PC relative value is 
performed by the assembler 305. Address resolution for 
local labels that should be resolved by an instruction 
5 address land address resolution for external labels are 
performed by the linker 307. 

i 

The following describes the construction and 
operation of the processor 309, the linker 307, the 
assembler 305, and the optimization apparatus 303 shown in 
10 Fig. 4. ; 

W Frocessojr 

U Fisj. 6 is a block diagram showing the details of the 

processed 309 and the external memory. 
Pl5 The processor 309 is capable of executing a maximum 

Q of three ,: instructions in parallel. This processor 309 

JJ includes; calculators 401a - 401c, general registers 402, an 

upper PC | 403, a lower PC 404, an upper PC calculator 411, a 
\Q lower PC J calculator 405, an INC 412, an instruction buffer 

20 4 08, an prefetch upper counter 410, a prefetch lower 

counter 413, instruction decoder 409a - 409c, a PC relative 

i 

value selector 420, an immediate selector 421, an operand 
data buffer 423, and an operand address buffer 422. The 
external t memory includes the data memory 406 and the 
25 instruction memory 407. 

In jthe following explanation, the upper PC 403 and 

1 

the lowe^r PC 404 will be collectively referred to as the 
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"PC", ank the upper PC calculator 411 and the lower PC 

i 

calculator 4 05 will be collectively referred to as the "PC 
calculator" . 

The first calculator 401a/ the second calculator 
401b, and the third calculator 401c each perform one 
calculation. These calculators are capable of calculating 
at the same time. 

The general registers 402 store data, addresses and 
other data. 

The upper PC 403 stores the upper 29 bits of the 

i 

address jof the first instruction in a set of instructions 
to be executed in the next cycle, which is to say, a packet 
address -j 

Tiip lower PC 404 stores the lower 3 bits of the 
address jof the first instruction in a set of instructions 
to be executed in the next cycle, which is to say, an in- 
packet a,ddress. 

The instruction memory 407 stores instructions that 
are expressed by the object code 308, 

The instruction buffer 408 stores instructions that 
have been read from the instruction memory 407. 

Th.e first instruction decoder 409a, the second 
instruction decoder 409b, and third instruction decoder 
4 09c decode instructions and, if the respective 

instructions are executable, give indications to other 

i 

components in the processor to have the instructions 

i 

executecjL The first instruction decoder 409a receives an 

i 
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input of j the first instruction stored in the instruction 
buffer 468, the second instruction decoder 409b an input of 
the next! instruction, and the third instruction decoder 
409c an fnput of a next instruction. These instruction 
decoders^ 409a - 409c investigate whether there is a 
parallel- execution boundary between the instruction units 
and onlyi have the instructions that should be executed in 
the prestnt cycle executed. As one example, when an 
instruction performs a calculation using a constant, the 
constant; is sent to the first calculator 4 01a via the 

immediate selector 421 and the first calculator 401a is 

i 

instructed to perforin the calculation. For a branch 
instruction, a PC relative value is sent via the PC 
relative! value selector 420 to the lower PC calculator 405 
and uppeic PC calculator 411 that are then instructed to 
update the PC. The instruction decoders 4 09a - 409c send 
control signals showing the number of executed instruction 
units toj have the mc 412 update the PC increment, and send 
control .signals showing the number of executed instruction 

units to; the instruction buffer 408 to have the executed 

I 

instruction units deleted from the instruction buffer 408. 

The ^ relative value selector 420 outputs the PC 
relativej value outputted by the instruction decoders 4 09a - 
409c to the lower PC calculator 405 and the upper PC 
calculator 411 . 

The immediate selector 421 outputs an immediate 

outputtejd by the instruction decoders 409a - 409c to the 

i 
I 
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general registers 402 and the calculators 401a ~ 401c* 

Thi INC 412 receives information regarding the number 
of executed instruction units via control signals sent by 

the instruction decoders 409a ~ 409c 7 and increments the 

i 

value or the upper PC 403 and the lower PC 404 in 
accordance with this number. By doing so, the INC 412 sets 
the packfet address of the first instruction in the set of 
instructions to be executed in the next cycle in the upper 
PC 403 and the in-packet address of the first instruction 
in the sjet of instructions to be executed in the next cycle 
in the Ibwer PC 4 04. 

The upper PC calculator 411 and lower PC calculator 

i 

4 05 respjectively update the upper PC 403 and the lower PC 

i 

404, When a branch instruction is decoded by the 

instruction decoders 409a ~ 409c, the upper PC calculator 

411 and lower PC calculator 405 respectively receive the 

upper 2Sj bits and the lower 3 bits of the PC relative value 

i 
j 

include^ in the branch instruction of the PC relative 

value „ .The lower PC calculator 405 increases or decreases 
the present value of the lower PC 404 by the lower 3 bits 
in the PC relative value and sends the calculation result 
to the iowex PC 404 as the new lower PC. The upper PC 
calculator 411 increases or decreases the present value of 
the uppsjr PC 403 by the upper 29 bits in the PC relative 
value arid sends the calculation result to the upper PC 403 
as the riew upper PC. This operation of the PC calculators 
is described later in this specification. As described 

! 
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above, wljien a branch instruction is executed, the packet 
address £>f the branch destination instruction that is to be 
executed; next is set in the upper PC 403 and the in-packet 
address is set in the lower PC 404, There are also cases 
where the upper PC calculator 411 and lower PC calculator 
405 update the PC by calculating an address using a PC 
relative) value and an address stored in the general 
registers 402, 

The prefetch upper counter 410 shows the upper 29 
bits of the address of the first instruction in the set of 
instructions to be read from the instruction memory 407, 
which isj to say, the packet address. The prefetch upper 
counter j410 normally increments this value by one in each 
cycle. When a branch instruction was executed in the 
previou^ cycle, the packet address of the branch 
destination instruction set in the upper PC 403 is sent to 
the prefetch upper counter 410 where it is set in place of 

the present value in the prefetch upper counter 410. 

I 

Thp prefetch lower counter 413 shows the lower 3 bits 
of the address of the first instruction in the set of 
instructions read from the instruction memory 407, which is 

to say, (the in-packet address. in this embodiment, the 

i 

value "^'bOQO" is set in the prefetch lower counter 413, 
As a result, the instructions to be read are indicated in 
packet ipits, so that one packet is sent from the 
instruction memory 407 to the instruction buffer 408 in 
each cycle, 
i 
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i 
i 

Th^ data memory 406 3tores operand, data. 
Th^ operand data buffer 423 and operand address 
buffer 422 are buffers that are located between the data 
memory 406 and the processor, 
5 The following explains the incrementing method and 

calculating method for instruction addresses* This is the 

most characteristic feature of the present embodiment. 

I 

Incrementing Method for Instruction Addresses 
10 Thje incrementing of addresses is performed by adding 

; j5 an increment value to the in-packet address of an 
K instruction, and adding any carry produced by the addition 

r* to the dacket address* 

Efl Fijg* 7 is an increment table showing the rules used 

/ 15 to increment the in-packet address. As shown in the 
y figure, jwhen the in-packet address is f '3 ! b000" or "S'bOlO", 

jAl the incrementing of the instruction address is performed by 

;u adding 2 to the in-packet address. When the in-packet 

— address i is "3'blOO", a carry to the packet address ia 

20 produced (which is to say, 1 is to be added to the upper 29 
bits of ithe instruction address) and the in-packet address 
is updated to "3'bOOO", This means that the incrementing 

of the in-packet address is a calculation that cycles 

i 

through|the three values "S'bOOO", "S'bOlO", and "S'blOO". 
25 As one ^xample, when the increment value is w 2 n and the 
value of the in-packet address before incrementing is 
'^'blOO?, the packet address after incrementing is "3'b010 n 
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and a cairry of "1" to the packet address is generated. 

Note that in the present embodiment, the in-packet 
address £oes not need to be expressed in binary* This is 
especially effective when the number of instruction units 
5 in an instruction packet is not a power of 2* When this is 
the case|. it is not possible to express the position of an 

instruction unit in an instruction packet in binary and use 

i 

a binary) calculation to shift the position of an 
instruction unit. However, in the present embodiment, the 
10 position* of an instruction unit in an instruction packet is 
% expressed using ra different values. By using a calculation 

W that cycles through these m values, the specifying of 
M* xnstruct^ion units and the calculations for shifting the 

jg instruction position can be achieved even if the number of 

ff *l5 instruction units in an instruction packet is not a power 
O- of 2, 1 

ill i 

S Method tor Calculating the Instruction Address 

^ Thfo following explains the carry method which is one 

t 

20 of the zdsthods used for calculating the instruction 

addresses in the present invention. Other methods used to 
calculate addresses are a separation method, an absolute 
position indicating method, and a linear addressing method, 
though these will be described later in this specification. 

25 In the qarry method, the upper 29 bits and lower 3 bits of 

i 

an instruction address are calculated separately. However, 
when calculating the upper bits, any carry to or from the 
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upper 29| bits that occurred when calculating the lower 3 

i 

bits is taken into account* 

The following explains the aiethod by which the 
present processor adds the address of a branch instruction 

and a PC relative value to find a branch destination 

i 

address.; The lower PC calculator 405 shown in Fig. 6 adds 

the lowejr 3 bits of the address of a branch instruction to 

i 

the lower 3 bits of the PC relative value. Fig. 8A is an 

addition table showing the addition rules used when adding 

i 

the lowejr 3 bits of the address of a branch instruction to 

I 

lower 3 bits of the PC relative value. As shown in Fig* 
8A, this addition of the lower 3 -bit values differs from a 
binary calculation in being a calculation that cycles 
through ;the three values "S'bOOO", "3*b010 n , and "3^100". 
When a qarry occurs as shown in Fig. 8A, the lower PC 
calculator 405 sends the carry to the upper PC value to the 
upper PC calculator 411. 

i 

Th«e upper PC calculator 411 shown in Fig. 6 adds the 
upper 29 bits of the address of a branch instruction to the 
upper 2^ bits of the PC relative value. When doing so, if 
the calculation of the lower PC calculator 405 has resulted 
in a cai|ry to the upper PC, the upper PC calculator 411 
also adds this carry. This addition is a normal addition 
of binary values. 

The addition results of the lower PC calculator 405 
and upp^r PC calculator 411 form the address of the branch 

5 
I 

destination instruction, . The addition result for the lower 

! 
| 
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3 bits ip set in the lower PC 4 04 and the addition result 
for the tipper 29 bits is set in the upper PC 403, 

The following explains the calculations of the 
optimization apparatus 303 f assembler 305, and linker 307 
for finding the PC relative value, which is to say the 
subtraction of the branch instruction address from the 
branch destination address. Like the addition described 
above, this subtraction is performed separately for the 
upper 29 ! bits and lower 3 bits. The lower address 
subtraction means 907 of the optimization apparatus 303, 
the lower address subtraction means 806 of the assembler 
305, and the lower address subtraction means 706 of the 

linker 307 subtract the lower 3 bits of the branch 

t 

instruction address from the lower 3 bits of the branch 
destination address. Fig, SB is a subtraction table 

i 

showing [the subtraction rules used when subtracting the 
lower 3 jbits of the PC relative value from the lower 3 bits 
of a branch destination address. As shown in Fig. 8B, this 
subtraction of the lower 3-bit values differs from a binary 
calculation in being a calculation that cycles through the 
three values n 3 , bQ00 w , "3 f b010", and "3^100". When a 
carry occurs as shown in Fig. 8B, the lower address 

subtraction means that performs the calculation (such as 

f 

lower address subtraction means 907) sends the carry from 
the upper PC value to the corresponding upper address 
subtraction means (such as upper address subtraction means 

910) . The various upper address subtraction means are 

I 

i 
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describe^! in more detail later. 

Th4 upper address subtraction means 910 in the 
optimisation apparatus 303/ the upper address subtraction 
means 809 in the assembler 305, and upper address 
subtraction means 709 in the linker 307 subtract the upper 
29 bits pf the address of a branch instruction from the 
upper 29; bits of the address of the branch destination 
instruction* When doing so, if the calculation of the 
lower address subtraction means 907 (or similar) has 
resultedj in a carry from the upper PC, the upper address 
subtraction means 910 (or similar) also subtracts this 

carry. ?This subtraction is a normal subtraction of binary 

i 

values- j 

Thjese subtraction results respectively form the lower 

i 

3 bits apd the higher 29 bits of the PC relative value. 
This metjhod is also used when the processor finds the 
address jof a branch destination instruction by executing a 
subtraction on the address of a branch instruction and a PC 
relative? value - 

Thje optimization apparatus 303, assembler 305, and 
linker ioi , which calculate a PC relative value from the 
difference between the address of a branch destination 
instruction and the address of a branch instruction, and 
the processor 309, which calculates the address of a branch 
destination instruction using this PC relative value, 
calculate addresses using the same carry method. As a 
result, !when executing a branch instruction, the processor 
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can correctly calculate the address of a branch destination 
instruction from the PC relative value. This address 
calculation method that uses a carry has a feature in that 
it can calculate addresses perform separate calculations 
for uppep bits and lower bits while maintaining the 

continuity between the two, 

i 
I 

Optimization Apparatus 

Fi^. 9 is a block diagram showing the components and 
input/output data of the optimisation apparatus 303 shown 

in Fig. )5. This optimisation apparatus 303 optimizes the 

i 

assembled code 302 generated by the compiler 301, links the 
instruction sequences together in packets of three 
instruction units f and outputs the resulting optimized code 
304- Thje optimisation apparatus 303 includes a code 
optimization apparatus 902 f an address assigning means 904, 
a label jdetecting means 90S, a lower address subtraction 
means 90j7, an upper address subtraction means 910, an 

address idifference calculating means 912 r and a label 

j 

information resolving means 914. 

I 

The code optimization apparatus 902 optimizes the 
assembler code 302 and so generates the optimization 
processing code 903. This processing of the code 
optimisation apparatus 902 is the same as any well-known 
optimization apparatus, and so will not be described. 

The address assigning means 904 estimates an address 
for each instruction in the optimization processing code 
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903 produced by the code optimisation apparatus 902 and 
assigned; an estimated address to each instruction. These 
addressee are called provisional addresses in this 
specification. As a result, the address assigning means 

904 outputs the address assigned codes 916. 

The label detecting means 905 detects local labels 
from the f address assigned codes 916. On detecting a label 
that should be resolved by an instruction address, the 
label detecting means 905 obtains the provisional address 
of the instruction including this label- Conversely, on 
detecting a label that should be resolved by a PC relative 
value, the label detecting means 905 obtains the 

i 

provxsiopal addresses of the instruction including this 
label anjd the branch destination instruction. After this, 
the labejl detecting means 905 outputs the label information 
906 that; shows the instructions that include labels and 
information on values for resolving these labels* 

Thje lower address subtraction means 907, the upper 
address jsubtraction means 910/ and the address difference 
calculating means 912 calculate the PC relative values for 
labels, ;in the label information 906, that should be 
resolved by PC relative values, 

Thje lower address subtraction means 907 subtracts the 
lower 3 jbits of the provisional address of a branch 
instruction from the lower 3 bits of the provisional 
address jof the branch destination instruction and outputs 

the resulting carry value 908 and lower subtraction result 

i 
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Thte upper address subtraction means 910 subtracts the 
upper 2$ bits of the provisional address of a branch 
instruction and the carry value 908 calculated by the lower 
5 address '.subtraction means 907 from the upper 29 bits of the 
provisional address of the branch destination instruction 
and outputs the resulting upper subtraction result 911. 

Th£ address difference calculating means 912 finds 
the address difference 913 by setting the lower subtraction 
10 result 9;09 calculated by the lower address subtraction 
U means 907 as the lower 3 bits and the upper subtraction 
S result 911 calculated by the upper address subtraction 

1^ means 9lj0 as the upper 29 bits. 

D'. Thj=? label information resolving means 914 converts an 

H5 instruction in the optimization processing code 903 
j ff including the present label into an instruction of a 

jij suitable! si2e, based on an address that was estimated and 

.0 assigned by the address assigning means 904 or the address 

^ differen.ce 913 found by the address difference calculating 

20 means 912, If the assigned address or the address 

difference 913 can be expressed using no more than 13 bits, 
the labep. information resolving means 914 converts the 
instruction into a 21-bit instruction, or if not the label 
information resolving means 914 converts the instruction 
25 into a 4 2 -bit instruction. 

Af jier the labels have been resolved, the label 
information resolving means 914 links the instruction 
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sequenced into packets of three instruction units and 
outputs the result as the optimized code 304. 

The following describes a specific operation of the 
optimization apparatus 303. 

Fig. 10 is a flowchart showing the operation 

I 

procedure of the optimization apparatus. 

First, the code optimization apparatus 902 optimizes 
the assembler code 302 and generates optimization 
processing code 903. Part of the optimization processing 

code 903; generated by the code optimization apparatus 902 

i 

is shownj in Fig. 11. Of the instructions in Fig. 11/ 
"Llimov |r2,rl" 1000 shows the position of the label Ll and 
is an instruction that indicates a transfer from register 
r2 to register rl- The instruction "jsr f" is a function 
call thajt performs a relative branch to the label f (an 
external! label) . A return from the function call to this 
address lis performed by a "ret" instruction. The 
instruction "add r0,r4" adds the values of registers rO and 
r4 and sjtores the result in register r4 . The instruction 
"and rl,jr3 w 1003 calculates a logical AND for the values in 
register] rl and r3 and stores the result in register r3. 
The instruction "mov L2,r2* 1004 transfers the address of 
the instruction located at the label 1*2 into the register 

r2. The? instruction "Id (r2),rO n 1005 transfers the data 

j 

stored ajt the address stored in register r2 into the 
registerj r0 - The instruction "bra Ll" 1006 performs an 
indirect branch to the label Ll (a local label) . Note that 

i 
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in Fig* ,11, the instructions that continue after 
instruction 1007 have been omitted, though these 
instructions do not include an instruction located at the 
label f !(step S9001) . 

The address assigning means 904 assigns a provisional 
address Ito each instruction in the optimization processing 
code 903 and so generates address assigned codes 916. Fig. 
12 shows the address assigned codes 916 generated from the 
optimization processing code 903 shown in Fig. 11. In this 
example/' provisional addresses starting from the value 
"32^00000800" have been assigned (step S9002) . 

The label detecting means 905 detects local labels in 

j 

the address assigned codes 916 and outputs label 
information 906 composed of instructions that include the 
detected labels and information on the values used to 
resolve .those labels. Fig. 13 shows the label information 
906 that is generated from the address assigned codes 916 
shown iri Fig. 12- As shown in this figure, label L2 of 
instruction 1104 is detected as a label that should be 
resolved by an instruction address and label Ll is detected 
as a label that should be resolved by a PC relative value* 

Information showing the address for resolving the label L2 

\ 
\ 

is appended to the instruction "mov L2,r2" that includes 
the label L2, and information showing the addresses of the 
branch destination instruction and branch instruction to be 
used for calculating a PC relative value is appended to the 
instruction "bra Ll" that includes the label Ll, Note that 
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since thf label £ in instruction 1101 is an external label, 
it is not optimized (steps S9003 r S9004) . 

When the label information 906 includes a label that 
should be resolved by a PC relative value, processing to 
calculate this PC relative value is performed. The lower 
address pubtraction means 907 calculates the lower 3 bits 
of the value shown by the label LI that is a PC relative 
value. *The lower address subtraction means 907 subtracts 
the lower 3 bits "3^010" of the provisional address 
n 32'h000s00812" of the branch instruction 1106 from the 
lower 3 |bits "3'bOOO* of the provisional address 
"32'h000;00300" of the branch destination instruction 1100. 
As a resjult, f, l" is obtained as the carry value 908 , and 
n 3 f bl00"; is obtained as the lower subtraction result 909 
(steps §9005, S9006) . 

The upper address subtraction means 910 calculates 
the uppdr 29 bits of the value shown by the label LI that 
is a PC jrelative value* The upper address subtraction 
means 9^0 subtracts the upper 29 bits w 2 9 ' b00000102" of the 
provisional address of the branch instruction 1106 and the 
carry value 908 "1" generated by the lower address 
subtraction means 907 from the upper 29 bits "29 1 hOOOQQlOQ" 

of the provisional address of the branch destination 

i 

instruction 1100. As a result, "29'hlffff f fd" ("-3" in 
base 10; minus numbers being hereafter shown using a 
complement) is obtained as the upper subtraction result 911 
(step S9007) . 

i 
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Th^s address difference calculating means 912 finds 
the address difference , which is to say the PC relative 
value, by setting the lower subtraction result 909 as the 
lower bits and the upper subtraction result 911 as the 
upper bits. In this example, the address difference 

i 

calculating means 912 sets "S'blOO* as the lower bits and 
"29'hlf ff ff f d" as the upper bits, giving an address 
difference of "32 »hf f f f f fee" (step S9008) . 

The label information resolving means 914 judges 
whether ;the v^l-ae used to resolve the label in the label 
information 906 can be expressed by a 13-bit value* The 
value that resolves the label L2 shown in Fig* 13 is 
"32 ^123145678", so that this value cannot be expressed as a 
13-bit rptlue, meaning that instruction 1104 including this 
label L2 will become a 42-bit instruction. On the other 
hand, the value used to resolve label Ll is "32 'hf ff f f fee", 
which cajn be expressed by a 13-bit value. Accordingly, the 
instruction 1106 that includes label Ll will become a 21- 
bit instruction (steps S9Q09, S901G, S9011) - 

The label information resolving means 914 links the 
instruction sequences into packets of three instruction 
units, based on the address assigned codes 916 . When doing 
so, the jlabel information resolving means 914 converts 
instructions that include labels into instructions of the 
determined size* Here, one instruction unit is used for 
21-bit instructions, and two units are used for 42-bit 
instructions. After this, the label information resolving 
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means 91 f outputs the instruction sequences that it has 
converted into packets as the optimized code 304- Fig, 14 
shows the optimised code 304 generated from the address 
assigned' codes 916 shown in Fig, 12* In Pig. 14, each row 

5 shows the instructions that form one instruction packet , 
with the] marks "M w showing the boundaries between 
instructions in a packet* Curved brackets are used in 

this drawing to indicate 42-bit instructions that each 
occupy two units (step S9012) , 

10 As; described above, addresses are estimated with a 

calculation method that uses a carry . In this way, a 
suitable 1 optimization apparatus for a processor that uses a 

carry mefthod can be achieved* 

i 

Nope that the provisional addresses assigned by the 
15 address jassigning means 904 and the PC relative values 

calculated by the address difference calculating means 912 
are values that are estimated for determining the sizes of 
all instructions that include labels. There are cases when 
these estimates differ from the actual values, that 
20 these values are not used hereafter in the processing. 

i 

Assembler 

Fi#. 15 is a block diagram that shows the 
construction of the assembler 305 shown in Fig. 5 and the 
25 input/output data related to the assembler 305. This 

assembler 305 converts the optimized code 304 generated by 

! 

the optimization apparatus 303 into relocatable codes 306 

j 
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that have a relocatable address format* The assembler 305 

i 
I 

includes" a machine language code generating means 802/ a 
label detecting means 804/ a lower address subtraction 
means 806/ an upper address subtraction means 809, an 
address difference calculating means 811 # and a label 
information resolving means 813. The machine language code 
generating means 802 converts the optimised code 304 into 
machine ^Language codes 803 that can be executed by the 
processor 309- However/ labels whose values have not been 
resolvedi are not converted and are stored in the machine 
language^ codes 8 03 as they are* The machine language code 

generating means 802 assigns a packet address and an in- 

i 

packet ajddress to each machine language code. As described 
later, tjhe labels are later resolved using these addresses. 

Thp label detecting means 804 finds a label that 
should bje resolved by a PC relative value, which is to say, 
a difference in addresses between two instructions and 
obtains { the addresses of the branch instruction and the 
branch destination instruction. After this, the label 
detecting means 804 outputs label information 805 that is 
composed of the instructions that include labels and the 

values tihat resolve these labels. 

i 

To resolve the label information 805 obtained by the 
label detecting means 804, the lower address subtraction 
means 806, the upper address subtraction means 809, and the 
address ; difference calculating means 811 calculate a PC 
relative* value as follows* 
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Thi lower address subtraction means 806 subtracts the 

t 

lower 3 fc>its of the address of a branch instruction from 
the lower 3 bits of the address of the branch destination 
instruction and outputs the carry value 807 and the lower 
subtraction result 808. 

The upper address subtraction means 809 subtracts the 

upper 29 bits of the address of a branch instruction and 

i 

the carry value 807 calculated by the lower address 
subtraction means 806 from the upper 29 bits of the address 
of the btanch destination instruction and outputs the 
resulting upper subtraction result 810. 

The address difference calculating means 811 finds 

i 

the address difference 812 by setting the lower subtraction 
result 8*08 calculated by the lower address subtraction 
means 80j6 as the lower 3 bits and the upper subtraction 
result &10 calculated by the upper address subtraction 
means 80;9 as the upper 29 bits* 

Thp label information resolving means 813 replaces 
the labels in the machine language codes 803 with the 
address ^differences 812 calculated by the address 
difference calculating means 811, and outputs the resulting 
relocatable codes 306. 

Thb following explains a specific example of the 
processing of the assembler 305 on receiving an input of 
the optimized code 304 of Fig, 14 that has been outputted 
by the optimization apparatus 303. 

Fig. 16 is a flowchart showing the operation of the 
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assembler . 

First, the machine language code generating means 802 
converts, each packet in the optimized code 304 into machine 
language codes 803 that are suited to the processor 309, 

5 However, the ma chine language code generating means 802 
does not; convert labels whose values have not been 
resolved^ so that these labels are stored as they are in 
the machine language codes 803. After this, the machine 
language code generating means 802 assigns packet addresses 

10 (hereafter also called "local packet addresses") and in- 
packet addresses to each instruction in the machine 
language! codes 803. Fig. 17 shows the machine language 
codes 803 that are generated from the optimised code 304 
shown in Fig- 14, Note that the actual machine language 

15 codes ar*e expressed in binary as sequences of zeros and 
ones, though for ease of understanding these machine 
language codes are shown in Fig, 17 in mnemonic form. The 
parallel; execution boundary information 100 and the format 
information 101 will also be clear at this stage, but are 

20 not illustrated to simplify the figure* In Fig. 17, packet 
addresses (local packet addresses) are assigned starting 

from the value "29 ' hOOOQOOOO" » The label f in the 

i 

instruction "jsr f" in packet 1300, the label L2 in the 
instruction *mov L2,r2" in packet 1301, and the label LI in 
25 the instruction "bra LI" in packet 1302 have not yet been 
resolved, sq that these instructions are not converted 
(steps &150Q, S1501) * 
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Nea^t, the label detecting means 804 detects labels, 

i 

out of the unresolved labels in the machine language codes 
803, which are local labels that should be resolved by a PC 
relative ! value/ and obtains the address of the instruction 
including the label, which is to say, the branch 
instruction, and the address of the branch destination 
instruction. The label detecting means 804 then outputs 
label information 805 that includes information showing the 
instruction including the label and the value that resolves 
the label * Fig. 18 shows the label information 805 that is 
generated from the machine language codes shown in Fig, 17* 
Here, label Ll is detected as a local label that should be 
resolved! by a PC relative value, "32^00000012" is obtained 
as the ajddress of the branch instruction, and 
"32 l hO00 | OO00O w is obtained as the address of the branch 
destination instruction (steps S1502, S1503) . 

Thp lower address subtraction means 8 06 then 
calculates the lower bits of the value Ll that is a PC 

relative value. The lower address subtraction means 806 

[ 

subtracts the lower 3 bits "3'bQlO" of the address 
"32'h00q000l2" of the branch instruction 1409 from the 
lower 3 .bits "3'bOOO" of the address "32 'hOOOOOOOO" of the 
branch destination instruction 1401. As a result, "1" is 
obtained; as the carry value 807 and "3*bl00" is obtained as 

i 

the lower subtraction result 808 (step S1504) . 

Nejxt, the upper address subtraction means 809 
calculates the upper bits of the value Ll that is a PC 
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relative! value- The upper address subtraction means 809 
subtracts the upper 29 bits "29 'h00000002" of the address 
of the branch instruction 1409 and the carry value 807 "1" 
from the upper 29 bits "29 ' hOOOOOOOO" of the address of the 
branch destination instruction 1401. As a result, 
"29'hlffff ffd" ("-3" in base 10, minus numbers being 
hereafter shown using a complement) is obtained as the 
upper subtraction result 810 (step S15Q5) . 

The address difference calculating means 811 finds 
the address difference, which is to say the PC relative 
value, by setting the lower subtraction result 808 as the 
lower bits and the upper subtraction result 810 as the 
upper bits. In this example, the address difference 
calculating means 811 sets "3'bl00 w as the lower bits and 
"29'hlffjffffd" as the upper bits, giving an address 
difference of "32 'hf ff ff fee" (step S1506) . 

The label information resolving means 813 judges 
whether the address difference 812 can be expressed by only 
its lower 13 bits- 1£ so, the label information resolving 
means 813 sets the lower 13 bits of the address difference 
812 as tjhe PC relative value, or if not, the label 
information resolving means 813 sets the entire address 
difference 812 as the PC relative value. As a result, a 
label in the machine language codes 803 is converted into a 
PC relative value. The address difference that resolves 
label LI in the label information in Fig. 17 is 
"32 *hffffffec", which can be expressed by the lower 13-bit 
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value "131fec n f so that the label LI in the machine 

! 

language! codes shown in Fig* 17 is converted into the lower 
13-bit v^lue. Fig. 19 shows the relocatable codes that are 
generatek from the machine language codes 803 shown in Fig* 

5 17 * In tig, 19/ the instruction 1609 has been produced by 
converting the label LI into a PC relative value* Fig. 19 
shows th^ parallel execution boundary information 100 and 
format information 101 of each instruction that had already 
been established when the machine language codes 803 were 

10 outputteh, and also shows the unused bit in each 
instruction packet (steps S1507, S1508, S1509) , 

As , described above, by finding a PC relative value by 
performing address calculation according to a carry method, 
an assembler corresponding to a processor that uses a carry 

15 method c^n he realized. 

Linker 

Fi^j. 20 is a block diagram showing the construction 

o£ the linker 307 shown in Pig- 5 and the I/O 

20 {input/ op tput) data of the linker 307. This linker 307 

combines- a plurality of relocatable codes 701, determines 
the addrpsses of each instruction, and outputs the object 
code 714} that is executable by the processor 309 and is in 
absolute; address format. The linker 307 includes the code 
combining means 702, the relocation information detecting 
means 7Qj4, the lower address subtraction means 706, the 
upper address subtraction means 709, the address difference 
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J 

I 

calculating means 711/ and the relocation information 

i 

resolvinjg means 713. 

3 

The code combining means 702 combines a plurality of 
inputted relocatable codes 701 and determines the addresses 
5 of all instructions. The code combining means 702 then 

resolves* the labels that should be resolved by instruction 
addresses using the determined addresses and outputs the 
combined codes 703 that result from its operation- 

The relocation information detecting means 704 

□ 10 searches^ for external labels that should be resolved by PC 
y relatives addresses and obtains the addresses of branch 

H instructions and the branch destination instructions* 

2 After doing so, the relocation information detecting means 

C 704 outputs relocation information 705 includes information 

□ 15 showing instructions that include labels and values to be 
:ft used to ^resolve the labels* To resolve the resulting 

to relocation information 705, the lower address subtraction 

;0 means 70;6, the upper address subtraction means 709, and the 

address difference calculating means 711 calculate PC 
20 relative; values, as described below. 

The lower address subtraction means 706 subtracts the 

lower 3 bits of the address of the branch instruction from 

> 

the lower 3 bits of the address of the branch destination 
instruction, and so generates a carry value 707 and a lower 

i 

25 subtraction result 708. 

The upper address subtraction means 7 09 subtracts the 

i 

upper 2SJ bits of the address of the branch instruction and 
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i 

I 

1 

the carry value 707 generated by the lower address 
subtraction means 706 from the upper 29 bits of the address 
of the branch destination instruction, and so generates the 
upper subtraction result 710. 
5 The address difference calculating means 711 sets the 

lower subtraction result 708 calculated by the lower 
address subtraction means 706 as the lower 3 bits and the 
upper subtraction result 710 calculated by the upper 
address subtraction means 7 09 as the upper 29 bits to 
(3 10 generate the address difference 712* 
'{a Thf relocation information resolving means 713 

U replaces 1 labels in the combined codes 703 with address 

III differences 712 calculated by the address difference 

^ calculating means 711, and outputs the resulting object 

15 code 308 r 

P Thp operation of the linker 307 is explained below 

|g using an| example where the relocatable codes 306 shown in 

|£ Pig, 19 jthat have been outputted by the assembler 305 have 

bean inputted, 

I 

50 Fi^, 21 is a flowchart showing the operation of the 

linker 307. 

Fijrst, the code combining means 7 02 combines a 
plurality of relocatable codes 701. Fig, 23 shows the 
state when the relocatable codes 814 shown in Fig. 19 have 
25 been coitjbined with the relocatable code shown in Fig, 22, 

The code combining means 702 combines these relocatable 

i 

codes with the packet address of the first relocatable code 

i 



in Fig. 22 as "29 f h00000000 w and the packet address of the 

i 

first relocatable code in Fig, 19 as "29'h00000001 n (step 
S2000, S2001) . 

The addresses of all instructions are determined in 
this way 7 so that the code combining means 702 can resolve 
the addresses of labels that should be resolved by 
instruction addresses and then output the resulting 
combined^ codes 703. Fig, 23 shows that the address of 
label L2, in instruction 1810 "mov L2,r2" is the starting 
address bf instruction packet 1815* This address has been 

i 

set at "32*3112345680", so that the code combining means 702 
uses thi;S value to replace the label L2 * Fig. 24 shows the 
resulting combined codes 7 03, In instruction 1910 in Fig, 
24, the label L2 has been replaced with this address 
"32^12345 680" (step S2002) . 

Ne?ct, the relocation information detecting means 704 
finds external labels in the combined codes 703 that should 
be resolved by PC relative values and extracts the 
$.ddr&&s&s of the instructions that include these labels and 
the addresses of the instructions where these labels are 
located,; which is to say, the addresses of branch 
instructions and branch destination instructions. After 
this, the relocation information detecting means 704 
outputs relocation information 705 that is composed of 
information showing the instructions including labels and 
the values to be used to resolve these labels- Fig. 25 
shows the label information that is generated from the 



combined codes 703 shown in Fig- 24* Here, label f is 
found as^ an external label that should be resolved by a PC 
relative value, so that "32 'hOQOOOOOa" is obtained as the 
address of a branch instruction and "32 'hOOGOOQOO* as the 
5 address of the branch destination instruction (steps S2Q03, 
S2004) . i 

i 

The lower address subtraction means 706 then 

i 

calculates the lower bits of the value f that is a PC 
relative value* The lower address subtraction means 706 
io subtracts the lower 3 bits "3'bGlO" of the address 

n 32 , h000i0000a ?? of the branch instruction 1905 from the 
lower 3 jbits "3 T b000" of the address "32 T h00000000" of the 
branch destination instruction 1901* As a result, "1" is 

obtained as the carry value 707 and tt 3 T bl00 n is obtained as 

i 

15 the lower subtraction result 708 (step S2Q05K 

Nej&t, the upper address subtraction means 709 
calculates the upper bits of the value f that is a PC 
relative value. The upper address subtraction means 709 
subtracts the upper 29 bits "29 ' 1^00000002" of the ^ddr^ss 
20 n 32 , h0000000a" of the branch instruction 1906 and the carry 
value 7(^7 "1" from the upper 29 bits "29^00000000" of the 
address jof the branch destination instruction 1901* As a 
result, i "29 , hlffffffe w is obtained as the upper subtraction 
result 710 {step S2006) . 

Thie address difference calculating means 711 finds 
the address difference 712, which is to say the PC relative 
value, by setting the lower subtraction result 708 as the 
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lower "bits and the upper subtraction result 710 as the 
upper bits* In this example, the address difference 
calculating means 811 sets "S'blOQ* as the lower bits and 
"29 'hlf f f f ff e" as the upper bits, giving an address 
difference of "32 ' hff f f f f f 4" (step S2007) . 

Next, the relocation information resolving means 713 
converts' a label in the combined codes 7 03 into a PC 
relative? value, setting the lower 13 bits of the address 
difference 712 as the PC relative value if this address 
difference 712 can be expressed by the lower 13 bits, or 
otherwise setting the entire address difference 712 as the 
PC relative value. The address difference that resolves 
the labejl f in the relocation inf ormation in Pig. 24 is 
"32 'hff fjf ff £4", which can be expressed by the lower 13-bit 
value "1,3 'hlf f 4", so that the label f in the combined codes 
703 showjn in Fig. 23 is converted into this lower 13~bit 
value toj produce the object code, The resulting object 
code is jshown in Fig, 26, In instruction 2106 in Fig. 26, 
the label f has been converted into the lower 13-bit value 
*13 f hlf^4" (steps S2008, S2009, S201Q) - 

As^ described above f the present linker finds PC 
relative values using an address calculation including a 
carry, and so is suited to a processor that uses a carry. 

Specific Operation of the Processor 

The following describes the operation of the 
processor when the object code shown in Fig, 26 has been 
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stored in the instruction memory 407- 

At!the start of execution of this object code, the 
upper PC| 403 is set at "29 'hOOQOOQOO" and the lower PC 404 
is set at "3^000". The prefetch upper counter 410 
receives- an input from the upper PC 403 and so is set at 
"29^00000000". 

The read of instructions from the instruction memory 
407 is performed in packet units according to the value in 
the prefetch upper counter 410. In detail, -instruction 
packet 2100 that is indicated by the prefetch upper counter 
410 is rjead from the instruction sequence stored in the 
instruction memory 407 and is stored in the instruction 
buffer 4p8. The value of the prefetch upper counter 410 is 
incremented by one in each cycle, and so here becomes 
"29^000:00001"- Hereafter r an instruction packet indicated 
by the prefetch upper counter 410 is read from the 
instruction memory 407 and written into the instruction 
buffer 4.08 in each cycle. 

The following explains fclie operations for decoding 

and executing instructions for the case when instruction 
packet 2104 is indicated by the upper PC 403 and 
instruction 2107 in instruction packet 2104 is indicated by 
the lower PC 404* The instructions stored in the 
instruction buffer 408 are interpreted by the instruction 

decoders 409a ~ 409c, The first instruction decoder 409a 

l 

receives an input of the first unit, unit 2107, in the 
instruction packet 2104 and investigates whether unit 2107 

m 



is a oneyunit instruction and whether there is a parallel 

i 

execution boundary. Since unit 2107 is a one-unit 
instruction and there is no parallel execution boundary, 
the second instruction decoder 409b receives an input of 
the next" unit, unit 2109, and investigates whether unit 

i 

2109 is £ one-unit instruction and whether there is a 
parallel; execution boundary* Since unit 2109 is a one-unit 
instruction and there is no parallel execution boundary, 
the third instruction decoder 4 09c receives an input of the 
next unit: and investigates whether this nejst unit is a one- 
unit instruction and whether there is a parallel execution 
boundary,- Since this unit is not a one-unit instruction, 
the thirb. instruction decoder 409c also receives an input 
of the fpllowing unit* The third instruction decoder 409c 
then finds that this following unit includes a parallel 
execution boundary. As a result, the instructions 2107, 

i 

2109, and 2110 are executed in parallel • 

Th£ first instruction decoder 4 09a decodes the 
instruction "add r0,r4" and outputs control signals to the 
first calculator 401a* The first calculator 401a adds the 
values e>£ registers rO and r4 and stores the result in 
registerj r4. The second instruction decoder 409b decodes 
the instruction "and rl,r3" and outputs control signais to 
the second calculator 401b, The second calculator 4 01b 
performs^ a logical operation on the values of registers rl 
and r3, and stores the result in register r3. The third 
instruction decoder 409c decodes the instruction "mov 
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32 l hl234^680,r2 ff and so has the immediate "32 'hl2345680 n 
transferred into register r2* 

In' this case, the instruction decoders 409a ~ 409c 
inform the INC 412 that a total of four instruction units 
5 have been executed ♦ The INC 412 increments the values in 
upper PCj 403 and the lower PC 404 by four units* As a 
result, the lower PC 4 04 becomes "S'bOOO", a carry of two 
to the upper PC 403 is generated, and the upper PC 403 

becomes "29 f 1x00000003" . This means that the first 

i 

10 instruction to be executed in the next cycle is instruction 
2112. [ 

The first instruction decoder 409a receives an input 
of the first unit, unit 2112, and investigates whether unit 

2112 is a. one-unit instruction and whether there is a 

15 parallel^ execution boundary. Since unit 2112 is a one-unit 
instruction and there is no parallel execution boundary, 
the second instruction decoder 409b receives an input of 
the next unit, unit 2113, and investigates whether unit 

2113 is a one -unit instruction and whether there is a 

20 parallel execution boundary. Here, the second instruction 

decoder ;409b finds that unit 2109 is a one-unit instruction 
and that; there is a parallel execution boundary. As a 
result, the processor 309 finds that instructions 2112 and 
2113 cai} be executed in parallel. 

25 The first instruction decoder 409a decodes the 

instruction "Id (r2),rQ", has the operand data, which has 
the value in register r2 as the operand address, read from 



the dataj memory 406 and stored in register rO- The second 
instruction decoder 40 9b decodes the instruction "bra 
13'hlfecrr and, since this is a branch instruction, updates 
the valufes in the upper PC 403 and lower PC 404 using the 
address of the branch destination instruction. 

First, the address indicated by the upper PC 403 and 
lower PC: 404 is amended. While a PC relative value shows 
the difference in addresses between a branch instruction 
and its branch destination instruction, the upper PC 403 
and lower PC 4 04 show the address of the first address to 
be executed in the same cycle as the branch instruction, so 
that the upper PC 403 and lower PC 404 are amended so that 
they indicate the address of the branch instruction* In 

detail, (the INC 412 increments the values of the upper PC 

i 

403 and jlower PC 404 by one unit to show that the branch 
instruction 2113 is preceded by one instruction unit, the 
first instruction 2112* As a result, the lower PC 4 04 
becomes pS'bOlO 1 ' and the upper PC 4 03 stays at 
tT 29 , hO0QO0003 1 *. 

Following this, the upper PC calculator 411 and the 
lower PC calculator 405 add the PC relative value 
"IS'hlfec" obtained by the second instruction decoder 409b 
to the upper PC 403 and the lower PC 404. Here, the sign- 
extended 32-bit value "32 f hf f f f f fee" is used as the PC 
relative* value. This addition is split into additions of 
the upper 2 9 bits and the lower 3 bits. 

Thje lower PC calculator 405 adds the lower 3 bits 
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"3 f falOO" j of the PC relative value to the value n 3 f fa010 M of 
the lower EC 404. As a result, a carry of one and the 
lower calculation result "3*b000 n are obtained, The lower 
PC calculator 405 sends the carry to the upper PC 
calculator 411, and sends the lower calculation result to 
the lowep PC 404. 

Next, the upper PC calculator 411 adds the upper 29 

i 

bits "29 !, hlfff fffd" of the PC relative value and the carry 
value "IV received from the lower PC calculator 405 to the 
value *'29 , h00000003 w of the upper PC 403. The upper PC 
calculator 411 sends the upper calculation result of 
"29^000^00001" to the upper PC 403, which sends the value 
on to the prefetch upper counter 410. As a result of this 
processing, the prefetch upper counter 410 is set at 
ff 29 , h000;00001 w , so that the next instruction packet to be 
prefetched will be instruction packet 2104. Also, since 
the uppejr PC 403 is "29 T h000Q0001" and the lower PC 404 is 
"S'bQQO"', the first instruction to be executed in the next 
cycle is instruction 2105. 

Hereafter, codes in the object code are successively 

read and executed in the same way, so that no explanation 

I 

will be [given for the other instructions. 

This completes the detailed explanation of the 
construqtions of the processor 309, linker 307, assembler 
305 and j optimization apparatus 303 shown in Fig. 5. A 
conventional compiler can be used as the compiler 301, so 
that no ; explanation of such will be given. 
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Not^e that while the processor of this embodiment 
includes- three instruction decoders 409a - 409c and three 
calculators 401a ~ 401c, the present invention is not 
limited to this construction, so that only one instruction 
5 decoder &nd one calculator may by provided. It is also 
possible; for the functions of the optimisation apparatus 
303 to be incorporated into the compiler 301, and to have 
the object code 308 generated from the source code 300 by 
the compiler 301, the assembler 305, and the linker 307. 
10 Inj the present embodiment, the prefetch lower counter 

413 was described as having the fixed value of "S'bOOO", 
though this need not be the case. As one example, this 

value majy be incremented by one in each cycle. This 

I 

results jin one byte of data being read from the instruction 
15 memory 4 } 07 and written into the instruction buffer 4 08 in 
each cyqle. 

i 

Second Embodiment 

The second embodiment of the present invention 
20 relates to a modification of the processor, optimisation 
apparatus, assembler, and linker of the first embodiment. 

This modification uses a different value as the PC relative 

i 

value fq>r resolving labels in branch instructions. 

In the first embodiment, the PC relative value in a 
25 branch instruction is a difference in addresses between the 
branch instruction and the branch destination instruction, 
while in this second embodiment, the PC relative value in a 
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branch instruction is a difference between the address of 
the branch destination instruction and the address of the 
first instruction in same set of instructions as the branch 
instruction . 

In [ this way, the PC relative value has a slightly 
different meaning than in the present embodiment. However, 
if the devices used to generate a program (i.e., the 
optimization apparatus 303, assembler 305, and linker 307 
that calculate the PC relative value) use the same meaning 
as the device that executes the program (i«e*, a processor 
that calculates an address based on the PC relative value), 
the processor will be able to correctly change the program 

i 

counter jto the address of a branch destination instruction 
when executing a branch instruction. 

Thp following explains the optimization apparatus 
303, assembler 305, linker 307, and processor. 

Thb label detecting means 905 of the optimization 
apparatus 303 generates the label information 906 for 
labels -that should bo resolved by PC relative values in the 
following way* Instead of generating label information 
after obtaining the provisional addresses of the branch 
instruction and the branch destination instruction in the 
same way as in the first embodiment, the label detecting 
means 905 generates the label information 906 after 
obtaining the provisional addresses of the branch 
destination instruction and the address of rhe first 
instruction in the same set of instructions as the branch 
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I 

instruction. In the same way as in the first embodxment, 
this label information 906 is then used to calculate the 
address difference 913 that is the difference between two 
provisional addresses and is used in the optimized code 

i 

5 304. The assembler and linker also operate in this way- 
The following describes a specific example of the 
object code 308 generated in this embodiment . 

Th£ assembler 305 replaces the label LI in 
instruction 1409 in the machine language codes shown in 
;sa . 10 Fig. 17 with the subtraction value "ISh'IffO" produced by 
ff subtracting the address "32 f h00000010 n of instruction 1408, 

□ which is! the first instruction in same set of instructions 

II as instruction 1409, from the address w 32'h00000000 n of the 

J branch destination instruction. In the same way, the 

15 linker 3j07 replaces the label f in instruction 1906 in the 
combined codes shown in Fiq. 24 with the subtraction value 
? "IB^lffiS 11 produced by subtracting the address 

3 "32^000(00008" of the instruction 1907, which is the first 

instr-uetjion in same set of instructions as instruction 
20 1906, fijom the address n 32 'hOOOOOQOO" of the branch 
destination instruction. Fig. 27 shows that the PC 
relative^ value of instruction 2213 differs from that shown 
in Fig. j26. 

The following describes the processor of the present 
25 embodiment « 

i 

The processor 309 executes object code that have been 

I 

generated as described above. When the processor 309 
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executes I a branch instruction, the PC relative value in the 
branch instruction is a difference in addresses between the 
branch destination instruction and the first instruction in 
same set' of instructions as the branch instruction. 
Accordingly, the processor 309 does not amend the values of 
the uppeir PC 403 and lower PC 404 , and/ in the same way as 
in the first embodiment, adds the PC relative value to the 
values in the upper PC 403 and lower PC 404 and updates the 
values in the upper PC 403 and lower PC 404 using the 
additionj results* When this processor 309 executes the 
object cjode shown in Fig, 27, the execution of instruction 
2213 results in the PC relative value "13hlff8 n being added 
to the present PC "32 1 hOOOGOOOS", resulting in the PC being 
updated jto w 32 , h0O00OO00 B * 

As! described above, the processor of the present 
embodiment does not need to amend the value of the program 
counter jin the same way as in the first embodiment whenever 
a branchy instruction is executed. The address of a branch 
destination instruction can instead be obtained by directly 
adding cj PC relative value to the PC. This reduces the 

total execution time. 

» 

! 

Third Embodiment 

Th|e third embodiment of the present invention relates 
to a processor that can indicate the execution position of 
an instruction by fully utilizing the lower 3 bits of 
instruction addresses. 

I 
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In the first embodiment, the lower 3 bits of the 
instruction address are used to indicate a position that is 
one of three units- In the present embodiment, however, 
full use I is made of these 3 bits by having them indicate 
5 one of eight units. 

Fig. 28A shows the construction of an instruction 
packet ih the present embodiment. This instruction packet 
is composed of eight instruction units- Each instruction 
unit in an instruction packet is 8 bits long, so that the 
10 total length of one instruction packet is 64 bits. The 
y processor in this embodiment reads one instruction packet 

W (64 bits;) in one cycle, 

U ; 

Fi<j. 2 SB shows the types of instructions used in thxs 

m embodiment. Each instruction is composed of 8-bit 

i>-j i 

Hs instruction -units, with there being one-, two-, three-, 

Q four-, f;ive-> and six-unit instructions. 
|| Fig, 28C shows the relation between in-packet 

% addressees and the instruction units in a packet. In the 

& same wci^j as in the first embodiment, a position in an 

20 instruction packet is indicated by the lower 3 bits of an 

instruction address « As shown in Fig* 28C, the in-packet 

i 

address j"3 f b000" indicates the first unit, the in-packet 
address ^ n 3'b001" indicates the second unit, the in-packet 
address \ "3 "bOlO" indicates the third unit, the in-packet 
25 address j "3^011" indicates the fourth unit, the in-packet 

i 

address ►"S'blQO" indicates the fifth unit/ the in-packet 

address ^3^101" indicates the sixth unit, the in-packet 

i 
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address "3^110" indicates the seventh unit, and the in- 
packet address "3 'bill" indicates the eighth unit. 

As described above, the processor of the present 
embodiment indicates the execution position of an 
instruction making full use of the lower 3 bits of the 
instruction address. As a result/ instructions can be 
executed! with a greater variation of execution units for 
one cycle, 
i 

Fourth Embodiment 

The fourth embodiment of the present invention 
relates to a method for calculating instruction addresses 
without p-sing a carry* 

Th£ first embodiment teaches a processor for 

i 

executing a program, and an optimization apparatus, 
assembler, and linker for generating a suitable program. 
All of these devices use a common method for calculating an 
instruction address using a carry. This has the effect 
that the processor can correctly generate the address of a 
branch destination instruction using a PC relative value. 
However, this effect can be achieved if the processor, 
optimization apparatus , assembler, and linker use a common 
address jcalculation method that does not use a carry- 
This present embodiment relates to such a calculation 
method that calculates addresses without using a carry. 

This calculation method that does not use a carry 
resembles the calculation method in the first embodiment in 

* 78 



that the calculation of address is performed separately for 
the upper 29 bits and lower 3 bits. However, the present 
method differs by not using a carry. 

The following explains the method by which the 
processor finds the address of a branch destination 
instruction by adding the address of a branch instruction 
and a PC relative value. The lower PC calculator 405 shown 
in Fig. & adds the lower 3 bits of the address of the 
branch instruction and the lower 3 bits of the PC relative 
value, jFig. 29A is an addition table showing the addition 
rules for adding the lower 3 bits of the address of the 
branch instruction and the lower 3 bits of the PC relative 
value in the present calculation method. As shown in the 
figure, ithis calculation differs from a normal addition of 
binary Values in that it cycles between the three states 
"3'b000";, n 3'b010", and M 3'bl00". Note that no carry is 
generated. 

Thp upper PC calculator 411 shown in Fig* 6 adds the 
upper 2 9f bits of the 3i<ldi?:&s& of the branch instruction and 
the uppejr 29 bits of the PC relative value. This is a 
normal addition of binary values. 

Thp results of the above additions form the address 
of a branch destination instruction. In detail, the 
addition result for the lower 3 bits is set in the lower PC 

\ 

404 and ithe addition result for the upper 29 bits is set in 

i 

the uppesr PC 403, 

The following explains the method used by the 
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optimization apparatus, assembler/ and linker to calculate 

the PC relative value, which is to say, to subtract the 

j 

address of -the branch destination instruction from the 
address of the branch instruction. This subtraction is 
split into an upper 29 bits and lower 3 bits like the 
addition* performed by the processor* The lower address 
subtraction means 907 of the optimization apparatus 303, 
the lower address subtraction means 806 of the assembler 
305, and the lower address subtraction means 706 of the 
linker 307 subtract the lower 3 bits of the address of a 
branch instruction from the lower 3 bits of the address of 
the branch destination instruction. Fig. 29B is a 
subtraction table showing the subtraction rules- for 
subtracting the lower 3 bits of the address of the branch 
instruction from the lower 3 bits of the address of the 
branch destination instruction. As shown in the figure, 
this calculation differs from a normal subtraction of 
binary values in that it cycles between the three states 
"3 , b000 ,, f, "3^010", and "3 T bl00 M > Note that no carry is 
generated. 

Thje upper address subtraction means 910 of the 
optimisation apparatus 303,, the upper address subtraction 
means 809 of the assembler 305, and the upper address 
subtraction means 709 of the linker 307 subtract the upper 
29 bits jof the address of the branch instruction from the 
upper 29 bits of the address of the branch destination 
instruction. This is a normal subtraction of binary 



values. 

The PC relative value is then found by setting the 
result of the above subtraction for the lower 3 bits as the 
lower 3 bits and the result of the above subtraction for 
the uppej: 29 bits as the upper 29 bits. 

Fig. 30 shows the object code that is generated by 
the above address calculation method of the present 
embodiment that does not use a carry. The PC relative 
values of instructions 2406 and 2413 differ to those in 
Fig. 26.' The following explains the calculation of the PC 
relative value of instruction 2406. 

The lower address subtraction means 706 subtracts the 
lower 3 jbits "3'bOlO" of the address of instruction 2406 
from the| lower 3 bits "3'bOOO" of the address of 
instruction 2401 in accordance with the subtraction table 

i 

shown in! Fi 9* 29B * This Produces the lower subtraction 
result "i3'bl00". 

Thje upper address subtraction means 709 subtracts the 
upper 29 bits "29 'hQOOQOOOl" of the address of instruction 
2406 from the upper 29 bits "29 'hOOOOOOOO" of the address 
of instruction 2401. This produces the upper subtraction 
result "29 , hlfffffff. 

The address difference calculating means 711 
generates the address difference "32 'hlf f ffff c n by setting 
the upper subtraction result "29'hlf f ff fff " as the upper 29 
bits and the lower subtraction result "3'blOO" as the lower 

i 

3 bits. I 



The relocation information resolving means 713 judges 
that the, address difference "32 f hlf f f f f f c" can be expressed 
by just the lower 13 bits "lS'hlffc" and so replaces a 
label with this value "13 f hlffc H as a PC relative value to 
generate instruction 2406* 

The processor 309 executes the object code generated 
as described above* When executing a branch instruction, 

i 

the processor 309 adds the upper PC 403 and lower PC 4 04, 
which have been amended to correctly indicate the branch 
instruction, to the PC relative value in the branch 
instruction without generating a carry. 

When the processor 309 executes instruction 2406 in 
the objejct code shown in Fig. 30/ the lower PC calculator 
405 adds the amended lower PC 404 "3^010" and the lower 3 
bits "3^100" of the PC relative value and updates the 
lower PC 404 to the resulting addition value "3 ? b000". The 
upper PCf calculator 411 adds the amended upper PC 403 
,, 29 , hQO<fQ0001 n and the upper 29 bits "29 »hlf f f f f f f " of the 
PC relative value and updates the lower PC 404 to the 
resulting addition value "29 'hOQOOQQOO" . 

As described above, the present calculation method 
can calculate addresses without a carry being sent between 
the lower PC calculator 4 05 and the upper PC calculator 

411. This means that address calculation can be performed 

i 

with a simpler hardware construction. 
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Fifth Embodiment 

The fifth embodiment of the present invention teaches 
a method for calculating instruction addresses using 
absolute values. 

This calculation method that uses absolute values 
resembles the calculation method in the first embodiment in 
that the calculation of address is performed separately for 
the upper 29 bits and lower 3 bits. However, the present 
method differs from the carry method in that the value of 
the lowex 3 bits of an instruction address are set as the 
lower 3 bits of the calculation result* 

The following explains the method by which the 
processojr finds the address of a branch destination 
instruction by adding the address of a branch instruction 
and a PQ relative value. The lower PC calculator 405 shown 
in Fig. 6 adds the lower 3 bits of the address of the 
branch instruction and the lower 3 bits of the PC relative 
value. Fig, 31A is an addition table showing the addition 
rules fdr adding the lower 3 bits of the address of the 
branch instruction and the lower 3 bits of the PC relative 
value iii the present calculation method that uses absolute 
values. , As shown in the figure, the lower 3 bits of the PC 
relative value are set as the lower 3 bits of the addition 
result* . 

The upper PC calculator 411 shown in Fig. 6 adds the 
upper 29 bits of the address of the branch instruction and 
the upper 29 bits of the PC relative value. This is a 
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normal addition of binary values. 

Th4 results of the above additions form the address 
of a branch destination instruction. m detail, the 
addition result for the lower 3 bits is set in the lower PC 
404 and the addition result for the upper 29 bits is set in 
the upper PC 403* 

The following explains the method used by the 
optimization apparatus 303, assembler 305 , and linker 307 
to calculate the PC relative value, which is to say, to 
subtract; the address of the branch destination instruction 
from the 1 address of the branch instruction. This 
subtraction is split into an upper 29 bits and lower 3 
bits, like the addition performed by the processor- The 
lower adfiress subtraction means 907 of the optimization 
apparatus 303, the lower address subtraction means 806 of 

the assembler 305, and the lower address subtraction means 

i 

706 of t;he linker 307 subtract the lower 3 bits of the 
address .of a branch instruction from the lower 3 bits of 
the address of the branch destination instruction. Pig. 
31B is a; subtraction table showing the subtraction rules 
for subtracting the lower 3 bits of the address of the 
branch instruction from the lower 3 bits of the address of 
the branch destination instruction in this calculation 
method that uses absolute values* As shown in the figure, 
the lower 3 bits of the branch destination address are set 
as the subtraction result for the lower 3 bits* 

The upper address subtraction means 910 of the 



optimization apparatus 303, the upper address subtraction 
means 809 of the assembler 305 , and the upper address 
subtraction means 709 of the linker 307 subtract the upper 
29 bits of the address of the branch instruction from the 
5 upper 29; bits of the address of the branch destination 
instruction. This is a normal subtraction of binary 
values. : 

The PC relative value is then found by setting the 
result of the above subtraction for the lower 3 bits as the 
10 lower 3 bits and the result of the above subtraction for 
the uppejr 29 bits as the upper 29 bits. 

i 

Fig. 32 shows the object code that is generated by 
the aboye address calculation method of the present 
embodiment that uses absolute values. The PC relative 

15 values of instructions 2606 and 2 613 differ to those in 

Fig. 26,; The following explains the calculation of the PC 
relative value of instruction 2606. 

Trie lower address subtraction means 706 subtracts the 
lower 3 bits "3 T b010" of the address of instruction 2406 

20 from the lower 3 bits "S'bOOO" of the address of 

instruction 24 01 in accordance with the subtraction table 
shown in Fig. 31B. This produces the lower subtraction 
result p'bOOO". 

The upper address subtraction means 709 subtracts the 

25 upper 29 bits "29 'hOOOOOOOl" of the address of instruction 
2406 from the upper 29 bits "29^00000000" of the address 
of instruction 2401. This produces the upper subtraction 



result "29 , hlfffffff"- 

The address difference calculating means 711 
generates the address difference "32 ' hlf f f f f f 8 n by setting 
the upper subtraction result "29* hlf f f f f f £ " as the upper 29 
bits and! the lower subtraction result "3'b000" as the lower 
3 bits. 

The relocation information resolving means 713 judges 
that the address difference "32 f hlf f f f f f 8" can be expressed 
by just the lower 13 bits "13' hlf f 8" and so replaces a 
label with this value "IS'hlffS" as a PC relative value to 
generate instruction 2606. 

The processor 309 executes the object code generated 
as described above. When executing a branch instruction, 
the processor 309 adds the upper PC 403 and lower PC 404, 

which have been amended to correctly indicate the branch 

i 

instruction, to the PC relative value in the branch 
instruction using the present absolute value method. 

When the processor 309 executes instruction 2606 in 
the object code shown in Fig. 32 , the lower PC calculator 
405 adds the amended lower PC 404 "3 T bOlO ,fc and the lower 3 
bits "3*bQQQ" of the PC relative value and updates the 
lower PC 404 to the resulting addition value "S'bOQO". The 
upper PC calculator 411 adds the amended upper PC 403 
"29^00000001" and the upper 29 bits "29'hlf f f f f f f " of the 
PC relative value and updates the lower PC 404 to the 
resulting addition value w 29'h00000000 rf . 

As described above, the present calculation method 
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can calculate addresses without needing to calculate the 
lower bits, so that the speed for calculating addresses can 
be improved. 

Sixth Embodiment 

The sixth embodiment of the present invention relates 
to a linear calculation method for addresses. Unlike the 
other embodiments, this linear calculation method 
calculates instruction addresses without splitting the 
calculation into an upper 29 bits and lower 3 bits. 

The following explains the present method for finding 
the address of a branch destination instruction from the 
address !of a branch instruction and a PC relative value. 
While th|e processor that uses the carry method is equipped 
with an ;upper PC calculator 411 for calculating the upper 
29 bits and a lower PC calculator 4 05 for calculating the 
lower 3 ^bits, a processor that uses the present linear 
calculation method is only equipped with one PC calculator 
for calculating a 32-bit address. The PC calculator in 
this linear calculation method adds a 32-bit address of a 
branch instruction and a 32-bit PC relative value. This 
calculation is a normal binary addition, 

Th^e addition result of the PC calculator is set as 
the address of the branch destination instruction* This 
means that the lower 3 bits of the addition result are set 
in the lower PC 404 and the upper 2 9 bits of the addition 
result are set in the upper PC 403. 
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The following explains the calculation of the PC 
relative value by the optimisation apparatus 303, assembler 
305, and linker 307, which is to say, the subtraction of 
the address of the branch instruction from the address of 

5 the branch destination instruction. Like the processor in 
this erobbdiment, the optimization apparatus 303, assembler 
305, and* linker 307 are each provided with only one 
calculator, the address subtraction means, for calculating 
a 32-bit address. The address subtraction means in this 

10 linear calculation method subtracts the address of a branch 
instruction from the address of a branch destination 
instruction. This calculation is a normal binary 
subtraction. The subtraction result is then set as the PC 
relative value. 

15 Fig, 33 shows the object code that has been generated 

using the linear calculation method of the present 
embodiment ♦ In Fig. 33, the PC relative values in 
instructions 2706 and 2713 differ to those shown in Fig. 
26. The following describes the method for calculating the 

20 PC relative value for instruction 2706. 

The address subtraction means in the linear 
calculation method subtracts the 32-bit address 
,, 32 , hOOdQOOOO" of instruction 2701 from the 32-bit address 
"32 'hOOGOOOOa" of instruction 2706 and so obtains the 

25 address difference "32'hfffffff 6 W . 

The relocation information resolving means 713 judges 
that the address difference "32 'hf f f ff f f 6" can be expressed 
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by just its lower 13 bits "l^'hlffe", and so replaces the 
label with n 13'hlff6 ?l as the PC relative value to generate 
instruction 2706. 

The processor 309 executes the object code generated 
as described above. When executing a branch instruction, 
the processor 309 adds the upper PC 403 and lower PC 404 
that have been amended to indicate the address o£ the 
branch instruction to the PC relative value using the 
present linear calculation method* 

When the processor 309 executes instruction 2706 in 
the object code shown in Fig* 33 , the PC calculator in this 
embodiment adds a 32-bit PC value "32 'hQOGQQQOa", which has 
the amended value of the upper PC 403 as the upper 29 bits 
and the amended value of the lower PC 404 as the lower 3 
bits, to the PC relative value "32 <hf f f f f f f 6" and so 
obtains the addition result "32 'hOOOOOQOO* . After this, 
the PC calculator updates the lower PC 404 to the lower 3 
bits ^bOOO" of this addition value, and the upper PC 403 
to the tipper 29 bits "29 1 hOOOOOOOO" of this addition value, 

Iri this way, the present linear calculation method 
can calculate addresses using a standard calculator as the 
PC calculator. This simplifies the structure of the 
processor. 

Seventh Embodiment 

The seventh embodiment of the present invention 
relates ;to a processor that interprets and executes PC 

i 
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adding instructions and PC subtracting instructions and to 
a compiler that generates such instructions. 

Fig. 34 shews the processor of the present 
embodiment. The processor of the present embodiment 
differs from the processor in the first embodiment in that 
it further includes a second lower PC calculator 2800 and a 
second upper PC calculator 2802 and in that the first 
instruction decoder 2801a/ the second instruction decoder 
2801b, and the third instruction decoder 2801c are all 
provided with new functions* 

The instruction decoders 2801a ~ 2801c are provided 
with an extra function for decoding PC adding instructions 
and PC subtracting instructions* Fig. 35A shows the 
operation that corresponds to a PC adding instruction which 
is shown in mnemonic form. As shown in Fig. 35A, a PC 
adding instruction adds a PC relative value "disp" to the 
value of the PC that is stored in a register and stores the 
addition result in the same register. Fig* 35B shows the 
operation that corresponds to a PC subtracting instruction 
which is shown in mnemonic form* As shown in Fig- 35B, a 
PC adding instruction subtracts a PC relative value Misp" 
from the value of the PC that is stored in a register and 
stores tjhe subtraction result in the same register. 

The second lower PC calculator 2800 and the second 
upper PC calculator 2802 perforin the PC adding instruction 

and PC subtraction instruction described above , using the 

i 

same calculation rules as the lower PC calculator 405 and 
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the upper FC calculator 411 described in the first 
embodiment • 

Fig. 36 shows the construction of the compiler of the 
present embodiment* 

The source code 2901 is a program written in a high- 
level language such as C* 

The intermediate code converting unit 2902 converts 
the source code 2901 into intermediate code 2903 which is 
an internal expression for the compiler. This intermediate 
code converting unit 2902 is a well-known technology and so 
will not be described. 

The PC value adding instruction converting unit 2904 
converts each intermediate code in the intermediate code 
2903 that adds a value of the PC and a variable into an 
assembler code 2906 for a PC adding instruction that is 
shown in Fig- 34 . 

The instruction converting unit 2905 converts the 
other intermediate codes into assembler code 2906. This 

instruct ion converting unit 2505 is a. well-known technology 

and so will not be described. 

The following describes a specific example of the 
operation of the present compiler. Fig, 37 is a flowchart 
showing ; the operation of this compiler. 

First, the compiler receives an input of source code* 
Fig. 38 shows source code which is written in C language, 
in Fig. 38, the external functions gl, g2, g3, and g4 are 
declare^, and the function f is defined as a function that 
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receives the int-type variable "i". This function f 
includes code that substitutes the address of function gl 
into the pointer fp if the value of "i" is 1, substitutes 
the address of function g2 into the pointer fp if the value 
of "i H is 2, substitutes the address of function g3 into 
the pointer fp if the value of "i" is 3, substitutes the 
address of function g4 into the pointer fp if the value of 
n i" is 4 Y and finally calls the function indicated by the 
pointer fp (step 3600) . 

Next, the intermediate code converting unit 2902 
converts the source code into intermediate codes. When 
doing so, the intermediate code converting unit 2902 
coverts (a) a source code that substitutes a pointer to an 
external function into a pointer variable into (b) an 
intermediate code that adds the difference between the 
address pf the start of present function and the address of 
the start of the external function to a temporary variable 
that stores the address of the start of the present 
function, and substitutes the addition result into the 
pointer svariable. 

Fig* 39 shows the intermediate codes that have been 

generated from the source program shown in Fig* 38. The 

f 

intermediate code 3201 shown in Fig. 39 is an intermediate 
code that has the label f marking the start of the function 
and that substitutes the present value of the PC, which is 
to say, ;the first address of function f, into the temporary 
variable tmp* The intermediate code 3202 is intermediate 
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code that judges whether the value of variable i is not 
"l w . The intermediate code 3203 is an intermediate code 
that branches to the label L when the judgement by 
intermediate code 3203 is true, that is, variable i is not 
"1". The intermediate code 3204 is executed when variable 
i is n l", and adds a difference, obtained by subtracting a 
first address of function f from the first address of 
function gl, to the temporary variable tmp into which the 
first address of function f has been substituted/ and has 
the addition result substituted into the variable fp. The 
intermediate code 3205 is an intermediate code that 
branches to the label L* 

The intermediate code 3206 includes the label Ll, and 
is an ir^termediate code that judges whether variable i is 
not equal to "2". The intermediate code 3207 branches to 
label L2 when the judgement in intermediate code 3206 is 
true, which is to say, when variable i is not n 2". The 
intermediate code 3208 is executed when variable i is equal 

to "2 W , and is an intermediate code that adds a difference, 
obtained by subtracting a first address of function f from 
the firsjt address of function g2, to the temporary variable 
tmp into which the first address of function f has been 
substituted, and has the addition result substituted into 
the variable fp. The intermediate code 3209 is an 

intermediate code that branches to the label L * 

i 

The intermediate code 3210 includes the label L2, and 
is an intermediate code that judges whether variable i is 
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not equal to "3". The intermediate code 3211 branches to 
label L3 when the judgement in intermediate code 3210 is 
true, which is to say, when variable i is not "3". The 
intermediate code 3212 is executed when variable i is equal 
to "3", and is an intermediate code that adds a difference, 
obtained by subtracting a first address of function f from 
the first address of function g3, to the temporary variable 
tmp into which the first address of function f has been 
substituted/ and has the addition result substituted into 
the variable fp. The intermediate code 3213 is an 
intermediate code that branches to the label L. 

The intermediate code 3214 includes the label L4, and 
is an intermediate code that adds a difference, obtained by 
subtracting a first address of function f from the first 
address of function g4, to the temporary variable tmp into 
which tt}e first address of function f has been substituted, 
and has the addition result substituted into the variable 

fp* The intermediate code 3215 includes the label L and is 

i 

an intermediate code that calls the function indicated by 
the variable fp* 

As described above , the intermediate codes in Fig. 39 
do not simply substitute the absolute address of the 
function gl, g2, g3 or g4 into the variable fp, but instead 
add a difference between the first address of function f 
and the i first address of one of the functions gl, g2, g3 f 
and g4 to the first address of the function f and 
substitute the addition result into the variable fp (steps 



94 



S3601 ~ S3603) • 

Next, the PC value adding instruction converting unit 
2 904 converts the intermediate codes into assembler code* 
The PC value adding instruction converting unit 2904 
searches for intermediate codes that add the value of the 
PC to a PC relative value and converts such codes into 
assembler code that uses the second lower PC calculator 
2800 and the second upper PC calculator 2802* The 
instruction converting unit 2905 then converts the 
remaining intermediate codes into assembler code. 

The PC value adding instruction converting unit 2904 
ascertains that the operand tmp in intermediate code 3204 
in Fig. 39 has been set at the value of the PC by the 
intermediate code 3201 and that the operator n +" indicates 
an addition of the value of the PC and a PC relative value, 
and so converts intermediate code 3204 into the assembler 
code addpc that performs an addition using the second lower 
PC calculator 2800 and the second upper PC calculator 2802* 
In the dame way/ the PC value adding instruction converting 
unit 2904 converts intermediate codes 3208, 3212, and 3214 
into assembler codes addpc. The other intermediate codes 
in Fig. ,39 are converted into assembler codes by the 
instruction converting unit 2905. 

Fig* 40 shows the assembler code that has been 
produced by converting the intermediate codes shown in Fig. 
39* In j Fig* 40, the assembler code 3301 has the label f 
marking the start of a function and is an instruction that 

r 
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transfers the value of the PC into register rl. The 
assembler code 2802 is an instruction that judges whether 
the constant w l w and the value of register rO are not 
equal. ,The assembler code 3303 is an instruction that 
branches to label LI when the judgement in assembler code 
2802 is true. The assembler code 3304 has the second lower 
PC calculator 2800 and the second upper PC calculator 2802 
add the PC relative value that is the difference between 
the first address of function gl and the first address of 
function f to the value of the PC which is the first 
address of function f and is stored in the register rl, and 
has the result transferred into register rl. The assembler 
code 3305 is an instruction that branches to the label L. 

The assembler code 3306 has the label Ll and is an 
instruction that judges whether the constant "2" and the 
value of register r0 are not equal- The assembler code 
3307 is >an instruction that branches to label L2 when the 
judgement in assembler code 3306 is true. The assembler 

code 3308 has the second lower- PC calculator 28 00 and the 

i 

second i^pper PC calculator 2802 add the PC relative value 
that is the difference between the first address of 
function g2 and the first address of function f to the 
value of the PC which is the first address of function f 
and is stored in the register rl, and has the result 
transferred into register rl. The assembler code 3309 is 

i 

an instruction that branches to the label L. 

The assembler code 3310 has the label L2 and is an 
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instruction that judges whether the constant "3" and the 
value of register rO are not equal- The assembler code 
3311 is an instruction that branches to label L3 when the 
judgement in assembler code 3310 is true. The assembler 
5 code 3311 has the second lower PC calculator 2800 and the 
second upper PC calculator 2802 add the PC relative value 
that is the difference between the first address of 
function g3 and the first address of function f to the 
value of the PC which is the first address of function f 
q 10 and is stored in the register rl, and has the result 
E transferred into register rl. The assembler code 3313 is 

O an instruction that branches to the label L* 

III Thp assembler code 3314 has the label L3 and is an 

2 instruction that has the second lower PC calculator 2800 

L 15 and the !second upper PC calculator 2302 add the PC relative 

<C value that is the difference between the first address of 

ill 4 

|5g function g4 and the first address of function f to the 

S value o^ the PC which is the first address of function f 

and is stored in the register rl, and has the result 

20 transferred into register rl. The assembler code 3315 has 
the label L and is an instruction that calls the function 
indicated by register rl. The assembler code 3316 is an 
instruction that ends the function* 

As : described above, when there is a source code in 

25 function f that substitutes a pointer to the external 

function g into a pointer variable, the present compiler 
does not: generate an instruction (such as "mov rl,g") that 
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transfers the address of the external function g into 
register; rl, but instead generates an instruction (addpc g- 
f, rl) that has adds a difference {g-f } in addresses 
between function f and function g to the address of 
function f that is stored in register rl, and has the 
result transferred into register rl* Since the value of 

! 

the PC relative value g-f is smaller that the absolute 
address g, the overall code size of programs can be reduced 
by using such addpc instructions. This has a further 
benefit for PIC codes where the addresses of a program in 
memory are determined when the program is executed, since 
calculation instructions that use such PC relative values 
must be used. 

In the same way as in the first embodiment , the 
assembler code produced by the compiler of the present 
embodiment is converted into object code that can be 
execute^ by the processor by an optimization apparatus 303, 
an assembler 305 and a linker 307* The processor executes 
the PC adding instruction "addpc g— f,rl n in the generated 
object code using the second lower PC calculator 2800 and 
the second upper PC calculator 2802, In detail, the second 
lower PC calculator 2800 adds the lower 3 bits of the 
constant! "g-f" and the lower 3 bits of the value stored in 
register; rl and sends any carry that is generated to the 
second upper PC calculator 2802* The second upper PC 
calculator 2802 adds the upper 29 bits of the constant "g- 
f", the upper 29 bits of the value stored in register rl, 
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and any carry it has received from the second lower PC 
calculator 2800. A value given by setting the addition 
result of the second lower PC calculator 2600 as the lower 
3 bits ahd the addition result of the second upper PC 
calculator 2802 as the upper 29 bits is then set in 
register rl * 

Note that while the instructions shown in Pig. 35A 
and 35B respectively are an addition and a subtraction of a 
constant and the value in a register, this need not be the 
case. An addition and a subtraction of values in 
registers, or an addition and a subtraction of a value in a 
register and the PC may equally be used. 

The calculation method used by the second lower PC 
calculator 2800 and the second upper PC calculator 2802 
also need not be the carry method used in the first 
embodiment. Provided the same method is used by the 
optimization apparatus 303, assembler 305, and linker 307 
that gerierate the object code to be executed by the 
processor, any of a no-carry method, a linear method, and 
an absolute value method may be used. 

Eighth Embodiment 

The eighth embodiment of the present invention 
relates to a debugger and a disassembler _ 

Fig, 41 is a block diagram showing the construction 
of the debugger and disassembler of the present embodiment. 

The input control unit 4000 receives an input from 
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the user and controls the other components according to 
this input. 

The packet address specifying unit 4001 calculates 
the uppep 29 bits of the address of the inputted 
instruction. 

The in-packet address specifying unit 4002 calculates 
the lower 3 bits of the address of the inputted 
instruction. 

The instruction memory 4 004 stores the instructions 
to be processed by the debugger and disassembler. As in 
the first embodiment, the addresses of instructions are 32 
bits in length and are composed of a packet address as the 
upper 29i bits and an in-packet address as the lower 3 bits. 
Fig. 41 phows how the instructions shown in Fig. 25 are 
stored. . 

The instruction reading unit 4003 reads an 

r 

instruction packet indicated by the packet address 
specifiejd by the packet address specifying unit 4001 from 
the instruction memory 4004. 

The instruction buffer 4005 stores the instruction 
packet rjead from the instruction memory 4004 by the 
instruction reading unit 4 003. 

Thp instruction decoding unit 4006 extracts the 
instruction unit with the in-packet address specified by 
the in-packet address specifying unit 4002 from the 
instruction buffer 4005 and decodes the extracted 
instruction unit, when the instruction unit is a branch 

i 
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instruction, the instruction decoding unit 4006 sends the 
PC relative value 4007 to the lower PC calculator 4008 and 
the upper PC calculator 4009. 

The label table 4011 is a table storing each label 
name associated with a corresponding instruction address. 
This label table 4 011 is generated by extracting 
information from the optimized code when the assembler 
described in the first embodiment generates machine 
language codes . 

In. Fig. 41, the address "32 1 hQOOOOOOO" corresponds to 
the label f, the address "32 ? hG00QQQ08 n corresponds to the 
label Li, and the address "32 f hl2345680" corresponds to the 
label L2. 

The display unit 4012 displays the results of a 
disassembling of an instruction. 

The instruction replacing unit 4013 writes the 
instruction that has been replaced into the instruction 
unit(s) :in the instruction buffer 4005 that is/are 
indicated by the in-packet address specified by the in- 
packet address specifying unit 4002, 

The instruction writing unit 4014 rewrites the 
instruction packet in the instruction memory 4004 with the 
packet address specified by the packet address specifying 
unit 4001 using the amended instruction packet stored in 
the instruction buffer 4005. 

The upper PC calculator 4 009 performs a calculation 
on the upper 29 bits of the instruction address specified 
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by the packet address specifying unit 4001 and the upper 29 
bits of the PC relative value 4007. 

The lower PC calculator 4 00S performs a calculation 
on the lower 3 bits of the instruction address specified by 
the in-pjacket address specifying unit 4002 and the lower 3 
bits of the PC relative value 4007. The calculation 
methods used by these PC calculators is the same as that 
used when generating the object code. 

The following describes a specific example of the 
operation of the present disassembler. Pig. 42 is a 
flowchart showing the operating procedure of this 
disassembler . 

First, the input control unit 4000 receives a command 
indicating the disassembling of an instruction and an input 
of the address of the instruction to be disassembled. In 
this specific example, the input control unit 4 000 receives 
"32 f h0000001a w as the instruction address (step S4100) ■ 

Next, the packet address specifying unit 4001 
specifics the packet address from the upper 29 bits of the 
instruction address* The instruction reading unit 4 003 
then re^ds the instruction packet with the specified packet 
address .from the instruction memory 4 004 and stores it in 
the instruction buffer 4 005. In this example, 
"29 l hOOQ00003" is specified as the packet address, and the 
instruction sequence "Id (r2) / r0||bra 13 f hlf ec I | add r2,r3" 
is stored in the instruction buffer 4 005 (step 54101) , 

Thje in-packet address specifying unit 4002 then 
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specifies the in-packet address from the lower 3 bits of 
the instruction address and informs the instruction 
decoding unit 4006 of the instruction unit that has the 
specified in-packet address. The instruction decoding unit 
4006 then extracts the indicated instruction unit from the 
instruction buffer 4005. In this example, "3^010" is 
specified as the in-packet address and the instruction "bra 
O'hlfec" that is the second unit in the instruction buffer 
4 005 is inputted into the instruction decoding unit 4006 
(step S4102} * 

The instruction decoding unit 4006 judges whether the 
inputted instruction is a branch instruction* In this 
example,; the inputted instruction "bra I3 f hlfec" is a 
branch instruction, so that this judgement is true (step 
S4103) . s 

When the instruction is a branch instruction, a 
calculation is performed on the PC relative value 4007 
indicated in the instruction and address of the inputted 
instruction. The lower PC calculator 4008 performs an 
additiori or a subtraction on the in-packet address of the 
inputted instruction and on the lower 3 bits of the PC 
relative value 4007 and sends the calculation result to the 
label search unit 4010. The upper PC calculator 4009 
perform^ an addition or a subtraction on the packet address 
of the inputted instruction and on the upper 29 bits of the 
PC relative value 4 007 and sends the calculation result to 
the lab<*l search unit 4010. The label search unit 4010 
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specifies the address of a label from the calculation 
result for the upper bits and the calculation result for 
the lower bits. In this example, the label address 
"32'h00000008" is specified by a calculation using the 
address "32 'hOOOOOOla" of the inputted instruction and the 
PC relative value 4007 "13'hlfec" (steps S4103, S4104) . 

The label search unit 4010 then refers to the label 
table 4011 and finds the label name that has the specified 
address. In this example, the label Ll corresponds to the 
address "32 ' hOOOOOOOS" (Step S4107) . 

The display unit 45l2 displays the assembler name of 
the branch instruction and the label name found by the 

label search unit 4010. In this example, the display unit 

I 

4012 displays the assembler name "bra" of the branch 

i 

instruction and the corresponding label name "Label Ll" 
(Step S4108) . 

The instruction decoding unit 4006 has the display 
unit 40l]2 display only the assembler name when the 
extracted instruction is not an assembler instruction (Step 

S4109) . ; 

Thp following describes a specific example of the 
operation of the present debugger. 

Fijg. 43 is a flowchart showing the operation of the 
present debugger. 

First, the input control unit 4000 receives a command 
indicating the debugging of an instruction, the address of 
an instruction to be replaced, and the instruction to be 
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used to replace of this instruction. In this specific 
example, the input control unit 4000 receives 
M 32 , h0000001a" as the instruction address and the 
subtraction instruction "sub r0,xl w as the replacement 
instruction (step S4200) - 

Next, the packet address specifying unit 4001 
specifies the packet address from the upper 29 bits of the 
instruction address. The instruction reading unit 4003 
then reads the instruction packet with the specified packet 
address from the instruction memory 4004 and stores it in 
the instruction buffer 4005. In this example, 
"29'h00000003" is specified as the packet address, and the 
instruction sequence "Id (r2) / r0||bra 13 'hlf ec | | add r2,r3* 
is stored in the instruction buffer 4005 (step S4201) . 

The in-packet address specifying unit 4 002 then 
specifies the in-packet address from the lower 3 bits of 
the instruction address. In this example, the in-packet 
address ,"3^010" is specified (step S4202) * 

If, the specified in-packet address is "S'bOOO* 1 , the 
first unit in the instruction packet in the instruction 
buffer 4005 is replaced with the inputted replacement 
instruction. If the specified in-packet address is 
"S'bOlO", the second unit in the instruction packet in the 
instruction buffer 4005 is replaced with the inputted 
replacement instruction. If the specified in-packet 
address is "3 , bl00 w , the third unit in the instruction 
packet in the instruction buffer 4005 is replaced with the 

s 
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inputted replacement instruction, in this example , the 
specified in-packet address is "S'bGlO", so that the 
instruction "bra 13'hlfec" in the second unit in the 
instruction packet in the instruction buffer 4005 is 
replaced with the inputted replacement instruction "sub 
r0,rl". As a result, the instruction packet in the 
instruction buffer 4005 becomes *ld (r2) / r0|lsub r0,rllladd 
r2,r3" {.steps S4203 - S4207} , 

The instruction writing unit 4014 replaces the 
instruction packet at the indicated packet address in the 
instruction memory 4004 with the instruction packet stored 
in the instruction buffer 4005. In this example, the 
instruction packet "Id (r2),rQIIbra 13 * hlf ec | ! add r2,r3" at 
the packet address "29^00000003" in the instruction memory 
4 004 is replaced with the instruction packet *ld 
(r2) , rO l; | sub r0,rl|jadd r2,r3" in the instruction buffer 
4005. [ 

As< described above, the disassembler of the present 
embodiment: can disassemble instructions tnat are executable 
for the processor 309 of the first embodiment. When an 
instruction is disassembled, instead of just displaying the 
PC relative value, the disassembler has the upper PC 
calculator and lower PC calculator calculate the address at 
which the label is located, uses the address to search the 
label table, and so displays the appropriate label name, 

Tbje debugger of the present embodiment reads 
instructions from the memory in units of instruction 
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packets that are byte-aligned/ rewrites an instruction in 
the instruction buffer, and writes the instructions back 
into the memory in units of instruction packets. This 
method is suited to the debugging of instructions that are 

5 not byte-aligned. 

Note that the calculation methods used by the lower 
PC calculator and the upper PC calculator do not need to be 
the carry method described in the first embodiment, so that 
another method, such as a separation method, an absolute 

10 value mejthod, or a linear method, can be used. 

The compiler, optimization apparatus, assembler, 
linker, processor, disassembler, and debugger of the 
present invention have been explained by way of the first 
15 to eightjh embodiments of the present invention, though it 

should be obvious that the present invention is not limited 
to these* Two example modifications are given below. 

(1) In the first to sixth embodiments, the c*s$ emblem code 

20 302, tha optimized code 304, the relocatable codes 306, and 
the object code 308 may be stored in a mask ROM, a 
semiconductor memory such as flash memory, a magnetic 
storage ; medium such as a floppy disk or a hard disk, or an 
optical disc such as a CD-ROM or DVD. 

25 

(2) In the seventh embodiment, the assembler codes 2 906 may 
be stored in a mask ROM, a semiconductor memory such as 



flash memory, a magnetic storage medium such as a floppy 
disk or a hard disk, or an optical disc such as a CD-ROM or 
DVD. 

Although the present invention has been fully 
described by way of examples with reference to accompanying 
drawings!, it is to be noted that various changes and 
modifications will be apparent to those skilled in the art. 
Therefore, unless such changes and modifications depart 
from the scope of the present invention, they should be 
construed as being included therein. 

f 

i 
f 
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What is jclaimed is: 

1. A processor for reading instructions from a memory 
according to a program counter, the memory storing 
instructions in one-byte units, and for executing the read 
instructions f 

the program counter including a first program counter 
and a second program counter, 

thk first program counter indicating a storage 
position of a processing packet in the memory, the 
processing packet being composed of an integer number of 
the one-byte units / 

thfe second program counter indicating a position of 
processing target instruction in the processing packet, the 

processing target instruction being an operation to be 

i 

executed by the processor. 

i 

2* The processor of Claim 1, including a first program 
counter jupdating means and a second program counter 
updating means, 

the second program counter updating means 
incrementing a value of the second program counter in 
accordance with an amount of instructions that were 
executed in a preceding cycle and sending any carry 
generated in an incrementing to the first program counter 
updating means, and 

the first program counter updating means adding the 
carry received from the second program counter updating 
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means to. the value of the first program counter, 

i 

s 

3. The processor of Claim 2, further including: 

program counter relative value extracting means for 
extracting/ when an instruction being executed includes a 
program counter relative value that is based on an address 
of a fir.^t instruction executed in a present cycle, the 
program counter relative value; and 

calculating means for adding the program counter 

i 

relative; value to the value of the first program counter 
and the value of the second program counter, and setting an 
additionj result as the value of the first program counter 
and the value of the second program counter. 

4* The processor of Claim 3, 

whferein the calculating means includes a first 
calculating unit and a second calculating unit, 

the second calculating unit adding the value of the 

second program counter and lower bits of trie program 

counter Relative value, setting a result of an addition as 
the value of the second program counter, and sending any 
carry generated in the addition to the first calculating 
unit, J 

thfe first calculating unit adding the value of the 
first prjogram counter, upper bits of the program counter 
relative value, and any carry received from the second 
calculating unit, and setting a result of an addition as 

i 

i 

; no 



the valufe of the first program counter. 

j 

5. The processor of Claim 3, 

wherein the calculating means includes a first 
calculating unit and a second calculating unit, 

the second calculating unit adding the value of the 
second program counter and lower bits of the program 
counter relative value without generating a carry, and 
setting a result of an addition as the value of the second 
program icounter, 

the first calculating unit adding the value of the 
first prjogram counter and upper bits of the program counter 
relative! value, and setting a result of an addition as the 
value of the first program counter. 

6- The program counter of Claim 3, 

wherein the calculating means adds the value of the 
first program counter and upper bits of the program counter 
relative value, sets a result o£ an addition as the value 
of the first program counter, and sets lower bits of the 
program ; counter relative value as the value of the second 
program ! counter . 

t 

7. The processor of Claim 3, 

wherein the calculating means adds the program 
counter , relative value and a value whose upper bits are the 
value of the first program counter and lower bits are the 



value of! the second program counter , and sets upper bits of 
a result 1 of an addition as the value of the first program 
counter and lower bits of the result as the second program 
counter . 

l 

8. The processor of Claim 2 r further including: 

prbgram counter relative value extracting means for 
extracting, when an executed instruction includes a program 
counter relative value that ±3 based on an address of the 
executed instruction, the program counter relative value; 

prpgram counter amending means for amending the value 
of the ^irst program counter and the value of the second 
program counter to indicate an address of the executed 
instruction; and 

calculating means for adding the program counter 
relative value, the value of the first program counter, and 
the value of the second program counter, and setting a 
result of an addition as the value of the first program 
counter .and -the value of the second program counter* 

9. The processor of Claim 2, further including: 

prbgram counter relative value calculating 
instruction decoding means for decoding a program counter 

i 

relative* value calculating instruction that performs an 
addition using a program counter relative value and one of 

(a) a value of the program counter stored in a 
register, and 

T 

J 
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8 (b} the value of the first program counter and the 

i 

9 value of the second program counter; 

10 calculating means for performing the addition 

11 indicated fay the program counter relative value calculating 

12 instruction to generate an addition result; and 

13 program counter value updating means for storing the 

14 addition result in one of 

15 {a$ the register, and 

16 (h) the first program counter and the second program 

17 counter.: 

1 10. The processor of Claim 1, 

2 wherein the first program counter indicates a memory 

3 address, the memory address being a storage position in the 

4 memory C|f a processing packet that is given by bit shifting 

5 the v^lue in the first program counter by log 3 n bits in a 

6 leftward direction, n being a length of a processing packet 

7 in bytes* 



11, Tho processor of Claim 10, further including 

an instruction buffer for temporarily storing 
instructions; and 

instruction reading means for transferring 
instructions with a minimum transfer size of one one-byte 
unit from the memory to the instruction buffer, in 
accordance with available space in the instruction buffer 
but regardless of a size of a processing packet. 

! US 
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1 12, An instruction sequence optimising apparatus, for 

2 generating optimized code from an instruction sequence, 

3 comprising: 

4 address assigning means for estimating a size of each 

5 instruction in the instruction sequence and assigning an 

6 address to each instruction, upper bits of each address 

7 indicating a memory address at which a processing packet is 

8 stored and lower bits of each address indicating a 

9 processing target instruction in the processing packet; 

10 label detecting means 

11 {(1) for detecting a label, which should be resolved 

12 by an address of a specified instruction, from the 

13 instruction sequence/ and obtaining the address of 

14 the specified instruction, and 

15 (2) for detecting a label, which should be resolved 

16 £y a difference in addresses of two specified 

1 7 instructions, from the instruction sequence, and 
1S bbtaining the addresses of the two specified 

19 instructions; 

20 program counter relative value calculating means for 

21 calculating, when a label which should be resolved by a 

22 difference in addresses of two specified instructions has 

23 been detected, a program counter relative value by 

24 subtracting an address of one of the two specified 

25 instructions from an address of another of the two 

26 specified instructions; 
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27 converting means 

28 (l) for converting an instruction that has a label 

29 that should be resolved by an address of a 

30 Specified instruction into an instruction with a 

31 £ise that is based on a size of the address of the 

32 specified instruction, 

33 |(2) for converting an instruction that has a label 

34 that should be resolved by a difference in 

36 addresses of two specified instructions into an 

O 36 instruction with a size that is based on a sise of 

;7l 37 the program counter relative value calculated from 

H 38 the addresses of the two specified instructions; 

III 39 and 

jc 40 optimized code generating means for generating 

41 optimized code by converting addresses of instructions in 
accordance with the sizes of instructions after conversion 



42 



43 by the Converting means* 



1 13 * Tin* instruction sequence optimising apparatus of Claim 

2 12, j 

3 whlerein the program counter relative value 

4 calculating means includes a lower bit subtracting unit and 

5 an uppezj bit subtracting unit, 

6 thje lower bit subtracting unit subtracting lower bits 

7 of the c^ddress of the one of the two specified instructions 

8 from lo^er bits of the address of the other of the two 

9 specified instructions, for setting a result of a 
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I 
f 

10 subtraction as lower bits of the program counter relative 

11 value, and sending any carry generated in the subtraction 

12 to the upper bit subtracting unit, and 

13 the upper bit subtracting unit subtracting upper bits 

14 of the address of one of the two specified instructions and 

15 any carry received from the lower bit subtracting unit from 

16 upper bits of the address of the other of the two specified 

17 instructions/ and for setting a result of a subtraction as 

18 upper bits of the program counter relative value* 

ji! 1 14, The! instruction sequence optimizing apparatus of Claim 

2 12, : 

y 3 wherein the program counter relative value 

: p 4 calculating means includes a lower bit subtracting unit and 

rf 5 an upper bit subtracting unit, 

|V 6 thje lower bit subtracting unit subtracting lower bits 

0 7 of the aiddress of one of the two specified instructions 

:Q 8 from lower bits of the address of the other of the two 

9 specified instructions without generating a carry and 

10 setting ja result of a subtraction as lower bits of the 

11 program (counter relative value, and 

12 the upper bit subtracting unit subtracting upper bits 

13 of the address of one of the two specified instructions 

14 from upper bits of the address of the other of the two 

15 specified instructions, and for setting a result of a 

16 subtraction as upper bits of the program counter relative 

17 value . \ 

! 
1 
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1 15. The- instruction sequence optimizing apparatus of Claim 

2 12, ; 

3 wherein the program counter relative value 

4 calculating means subtracts upper bits of an address of one 

5 of the two specified instructions from upper bits of an 

6 address pf the other of the two specified instructions , 

7 sets a rUsult of a subtraction as upper bits of the program 

8 counter relative value r and sets lower bits of the other of 

9 the two specified instructions as lower bits of the program 
10 counter irelative value. 

I 

1 16. An assembler that generates relocatable code from an 

2 instruction sequence, each address of an instruction in the 

3 instruction sequence having upper bits that indicate a 

4 memory atddress at which a processing packet is stored and 

5 lower bits that indicate a position of processing target 

6 instruction that is included in the processing packet, 

7 the assembler comprising; 

8 label detecting means for detecting a label in the 

9 instruction sequence that should be resolved by a 

10 difference in addresses between two specified instructions, 

11 and obtaining the addresses of the two specified 

12 instructions; 

13 program counter relative value calculating means for 

14 calculating a program counter relative value by subtracting 

15 an address of one of the two specified instructions from an 

i 
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16 address jof another of the two specified instructions; and 

17 replacing means for replacing the label with the 

18 program icounter relative value calculated by the program 

19 counter xelative value calculating means. 

1 17. The assembler of Claim 16, 

2 wherein the program counter relative value 

3 calculating means includes a lower bit subtracting unit and 

4 an upper bit subtracting unit / 

5 the lower bit subtracting unit subtracting lower bits 

6 of the address of the one of the two specified instructions 

7 from lower bits of the address of the other of the two 

8 specified instructions, for setting a result of a 

9 subtraction as lower bits of the program counter relative 

10 value, and sending any carry generated in the subtraction 

11 to the upper bit subtracting unit, and 

12 the upper bit subtracting unit subtracting upper bits 

13 of the address of one of the two specified instructions and 

14 any carr^y received from the lower bit subtracting unit from 

15 upper bits of the address of the other of the two specified 

16 instructions, and for setting a result of a subtraction as 

17 upper bits of the program counter relative value* 

S 

1 18. The [assembler of Claim 16, 

2 wherein the program counter relative value 

3 calculating means includes a lower bit subtracting unit and 

4 an upper bit subtracting unit, 
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5 thte lower bit subtracting unit subtracting lower bits 

6 of the address of one of the two specified instructions 

7 from lower bits of the address of the other of the two 

8 specified instructions without generating a carry and 

9 setting 'a result of a subtraction as lower bits of the 

10 program counter relative value, and 

11 the upper bit subtracting unit subtracting upper bits 

12 of the a t ddress of one of the two specified instructions 

13 from upper bits of the address of the other of the two 

14 specified instructions, and for setting a result of a 

15 subtraction as upper bits of the program counter relative 

i 

16 value . , 

t 

1 19. The assembler of Claim 16/ 

2 wherein the program counter relative value 

3 calculating means subtracts upper bits of an address of one 

4 of the two specified instructions from upper bits of an 

5 address bf the other of the two specified instructions, 

t 

6 sets a result of a subtraction as upper bits of the program 

7 counter relative value / and sets lower bits of the other of 

8 the two specified instructions as lower bits of the program 

9 counter relative value* 

! 

i 

1 20 • A linker that generates object code by combining 

2 relocatable code, each address of an instruction in the 

3 relocatable code having upper bits that indicate a memory 

4 address at which a processing packet is stored and lower 
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bits thajt indicate a position of processing target 
instruction that is included in the processing packet, 
the linker comprising; 

relocation information detecting means for detecting 
a label sin the relocatable code that should be resolved by 
a difference in addresses between two specified 
instructions, and obtaining the addresses of the two 
specified instructions; 

program counter relative value calculating means for 
calculating a program counter relative value by subtracting 
an address of one of the two specified instructions from an 
address jof another of the two specified instructions; and 

replacing means for replacing the label with the 

i 

program ^counter relative value calculated by the program 
counter .relative value calculating means. 

21. The linker of Claim 20, 

wherein the program counter relative value 
calculating means includes a lower bit subtracting unit and 
an upper bit subtracting unit, 

thp lower bit subtracting unit subtracting lower bits 
of the address of the one of the two specified instructions 
from lowpr bits of the address of the other of the two 
specified instructions/ for setting a result of a 
subtraction as lower bits of the program counter relative 

i 

value, and sending any carry generated in the subtraction 
to the upper bit subtracting unit, and 

E 
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the upper bit subtracting unit subtracting upper bits 
of the address of one of the two specified instructions and 
any carry received from the lower bit subtracting unit from 
upper bits of the address of the other of the two specified 
instructions , and for setting a result of a subtraction as 
upper bits of the program counter relative value. 

22. The jlinker of Claim 20, 

wherein the program counter relative value 
calculating means includes a lower bit subtracting unit and 

s 

an upper bit subtracting unit, 

this lower bit subtracting unit subtracting lower bits 
of the address of one of the two specified instructions 
from lower bits of the address of the other of the two 
specified instructions without generating a carry and 
setting a result of a subtraction as lower bits of the 
program icounter relative value f and 

thjS upper bit subtracting unit subtracting upper bits 
of the redress of one of the two specified instructions 
from upper bits of the address of the other of the two 
specified instructions, and for setting a result of a 
subtraction as upper bits of the program counter relative 
value ♦ ; 

23. The llinker of Claim 20/ 

wherein the program counter relative value 
calculating means subtracts upper bits of an address of one 
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4 of the t|wo specified instructions from upper bits of an 

5 address 'of the other of the two specified instructions, 

6 sets a result of a subtraction as upper bits of the program 

7 counter relative value, and sets lower bits of the other of 

8 the two specified instructions as lower bits of the program 

9 counter relative value. 

1 24 * A disassembler that receives an indication of an 

2 address of an instruction in object code and outputs an 

3 assembler name of the instruction at the indicated address/ 

4 each address of an instruction in the object code having 

5 upper bits that indicate a memory address at which a 

6 processing packet is stored and lower bits that indicate a 

7 position of processing target instruction that is included 

8 in the processing packet, 

9 the disassembler comprising: 

10 program counter relative value extracting means for 

11 extracting, when the indicated instruction includes a 

12 program counter relative value, the program counter 

13 relative value from the indicated instruction; 

14 label addressing calculating means for adding an 

15 address of the indicated instruction to the extracted 

t 

16 program {Counter relative value and setting an addition 

S 

17 result a label address ; 

18 storing means for storing a label name corresponding 

19 to eacri label address; and 

t 

! 

20 searching means for searching the storing means for a 

i 
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label naine that corresponds to the calculated label address 
and outpjutting the corresponding label name. 



1 25. The disassembler of Claim 24, 

2 wherein the label address calculating means includes 

3 a lower bit calculating unit and an upper bit calculating 

4 unit, 

5 the lower bit calculating unit for adding lower bits 

6 of the address of the indicated instruction and lower bits 

7 of the program counter relative value, setting a result of 

8 an addition as lower bits of a label address, and sending 

9 any carry generated by the addition to the upper bit 

10 calculating unit/ and 

11 thje upper bit calculating unit adding upper bits of 

12 the address of the indicated instruction, upper bits of the 

13 program ;counter relative value, and any carry received from 

14 the low$r bit calculating unit, and setting a result of the 

15 an addition as upper bits of the label address. 

1 26. The ^disassembler of Claim 24, 

t 

2 wherein the label address calculating means includes 

3 a lower Jbit calculating unit and an upper bit calculating 

4 unit, 

5 the lower bit calculating unit adding lower bits of 

6 the address of the indicated instruction and lower bits of 

7 the program counter relative value without generating a 

i 

8 carry, sjnd setting a result of an addition as lower bits of 
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9 a label (address, and 

10 thje upper bit calculating unit adding upper bits of 

11 the address of the indicated instruction and upper bits of 

12 the program counter relative value, and setting a result of 

13 an addition as upper bits of the label address . 

1 27. The disassembler of Claim 24, wherein 

2 this label address calculating means adds upper bits 

3 of the address of the indicated instruction and upper bits 

i 

4 of the program counter relative value, sets a result of an 

5 addition as upper bits of the label address, and sets lower 

6 bits of ithe program counter relative value as lower bits of 

7 the labdl address. 

5 
i 
\ 

1 28 * A debugger that receives an indication of an address of 

2 an instruction in object code and replaces the instruction 

3 at the indicated address with a replacement instruction, 

4 each address of an instruction in the object code having 

5 upper bits that indicate a memory address at which a 

6 processing packet is stored and lower bits that indicate a 

7 positio^ of processing target instruction that is included 

8 in the processing packet, 

9 th£ debugger comprising: 

10 processing packet reading means for reading a 

n processing packet that is indicated by upper bits of the 

12 indicated address from the memory and writing the 

13 processing packet into an instruction buffer; 
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14 instruction writing means for writing the replacement 

15 instruction into the processing packet in the instruction 

i 

16 buffer oyer an instruction that is indicated by the lower 

17 bits of Jthe indicated address; and 

18 processing packet writing means for writing the 

19 processing packet in the instruction buffer back into the 

20 memory after the replacement instruction has been written. 

1 29. A compiler that generates an instruction sequence from 

2 source code, 

3 thte compiler generating a program counter relative 

4 value calculating instruction that is executed by a 

i 

5 processcp, the program counter relative value calculating 

6 instruction being an instruction that performs a 

7 calculation using a first value and a program counter 

8 relative; value and uses a result of the calculation to 

i 

9 update t,he first value, the first value being one of 

> 

10 a value of a program counter stored in a 

11 register, and 

12 (b) the value stored in a program counter of the 

13 processor, 

14 whjerein upper bits of the first value indicate a 

15 memory address at which a processing packet is stored, and 
lower bits of the first value of the program counter 

indicate a processing target instruction that is included 

i 

in the processing packet. 
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1 30. The bompiler of Claim 29, 

i 

2 wherein the processor includes a lower bit 

3 calculating unit and an upper bit calculating unit, 

4 thp program counter relative value calculating 

5 instruction having the lower bit calculating unit perform a 

6 lower bi ; t calculation and the upper bit calculating unit 

7 perform an upper bit calculation , 

8 thp lower bit calculation being an addition using 

9 lower bits of the first value and lower bits of the value 

10 of the program counter relative value, where a result of 

11 the lowejr bit calculation is set as the lower bits of the 

12 first value and any generated carry is sent to the upper 

13 bit calculating unit, and 

14 the upper bit calculation being an addition using 

15 upper bits of the first value, upper bits of the value of 

16 the program counter relative value and any carry received 

17 from the! lower bit calculating unit, where a result of the 

18 upper bit calculation is set as the upper bits of the first 

19 value . 

! 
1 

1 31. The compiler of Claim 29, 

2 wherein the processor includes a lower bit 

3 calculating unit and an upper bit calculating unit, 

I 

4 the program counter relative value calculating 

5 instruction having the lower bit calculating unit perform a 

6 lower bit calculation and the upper bit calculating unit 

7 perform -an upper bit calculation, 
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thf lower bit calculation being an addition using 
lower bits of the first value and lower bits of the value 
of the program counter relative value that does not 
generate' a carry, where a result of the lower bit 
calculation is set as the lower bits of the first value, 
and • 

the upper bit calculation being a calculation using 
upper bits of the first value and upper bits of the value 
of the program counter relative value/ where a result of 
the upper bit calculation is set as the upper bits of the 
first value. 

! 
I 

32, The jcompiler of Claim 29, 

wherein the processor includes an upper bit 

calculating unit, 

f 

the program counter relative value calculating 
instruction having the upper bit calculating unit perform 
an uppe^ bit calculation and setting lower bits of the 
program icountet relative value as lower bits of the first 
value, and 

the upper bit calculation being an addition using 
upper bijts of the first value and upper bits of the value 
of the program counter relative value, where a result of 
the upper bit calculation is set as the upper bits of the 

first vajlue. 

■> 

) 

33. A computer-readable recording medium storing an 
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2 instruction sequence optimizing program that generates 

i 
J 

3 optimized code from an instruction sequence, the 

4 instruction sequence optimizing program including: 

5 art address assigning step for estimating a size of 

6 each instruction in the instruction sequence and assigning 

) 

7 an addrdss to each instruction,, upper bits of each address 

8 indicating a memory address at which a processing packet is 

9 stored a|nd lower bits of each address indicating a 

tO processing target instruction in the processing packet; 
Q 11 a fLabel detecting step (1) for detecting a label, 

;T5 12 which should be resolved by an address of a specified 

i*f. 13 instruction, from the instruction sequence, and obtaining 

W 14 the addzjess of the specified instruction, and 
: fi 15 ](2) for detecting a label, which should be resolved 

16 by a difference in addresses of two specified 

17 instructions, from the instruction sequence, and 
W is pbtaining the addresses of the two specified 

.ft 19 instructions ; 

20 a program counter relative value calculating step for 

21 calculating, when a label which should be resolved by a 

22 difference in addresses of two specified instructions has 

23 been detected/ a prograxa counter relative value by 

i 

24 subtracting an address of one of the two specified 

i 

25 instructions from an address of another of the two 

26 specified instructions; 

27 a bonverting step 

28 j(l) for converting an instruction that has a label 
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29 that should be resolved by an address of a specified 

30 instruction into an instruction with a size that is based 

31 on a size of the address of the specified instruction, 

32 12) for converting an instruction that has a label 

33 £hat should be resolved by a difference in 

34 addresses of two specified instructions into an 

35 instruction with a size that is based on a size of 

36 the program counter relative value calculated from 

37 £he addresses of the two specified instructions; 

38 fmd 

39 an 1 optimized code generating step for generating 

40 optimized code by converting addresses of instructions in 

i 

41 accordance with the sizes of instructions after conversion 

42 in the converting step. 

e 
t 

1 34. The jcomputer-readable recording medium of Claim 33, 

2 wherein the program counter relative value 

3 calculating step includes a lower bit subtracting substep 

4 and an Upper bit- subtracting sub step, 

i 

5 thje lower bit subtracting substep subtracting lower 

I 

6 bits of jthe address of the one of the two specified 

i 

7 instructions from lower bits of the address of the other of 

8 the two jspecified instructions, for setting a result of a 

9 subtraction as lower bits of the program counter relative 

i 

10 value, ^nd sending any carry generated in the subtraction 

11 to the qpper bit subtracting substep, and 

12 thje upper bit subtracting substep subtracting upper 

! 
i 
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i 

! 
i 

j 

\ 

13 bits of j:he address of one of the two specified 

14 instructions and any carry received from the lower bit 

15 subtracting substep from upper bits of the address of the 

16 other of ' the two specified instructions, and for setting a 

17 result of a subtraction as upper bits of the program 

t 

18 counter relative value. 

i 

1 35* The fcomputer-readable recording medium of Claim 33, 

2 wherein the program counter relative value 

! J 3 calculating step includes a lower bit subtracting substep 

W 4 and an upper bit subtracting substep, 

M> s th^ lower bit subtracting substep subtracting lower 

j<S 6 bits of jthe address of one of the two specified 

7 instructions from lower bits of the address of the other of 

D 8 the two jspecified instructions without generating a carry 

III 9 and settjing a result of a subtraction as lower bits of the 

'10 program jcounter relative value, and 
O 11 this upper bit subtracting substep subtracting upper 

12 bits of Jthe address of one of the two specified 

13 instructions from upper bits of the address of the other of 

14 the two jspecified instructions f and for setting a result of 

15 a subtraction as upper bits of the program counter relative 

16 value . ] 

t 

1 36. The [computer-readable recording medium of Claim 33 , 

! 

2 whjerein the program counter relative value 

3 calculating step subtracts upper bits of an address of one 
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4 of the t!wo specified instructions from upper bits of an 

5 address jof the other of the two specified instructions, 

6 sets a result of a subtraction as upper bits of the program 

7 counter Irelative value, and sets lower bits of the other of 

8 the two specified instructions as lower bits of the program 

9 counter relative value. 

1 37 * A computer-readable recording medium storing an 

2 assembler program that generates relocatable code from 

Q 3 optimized code that have been generated from an instruction 

*J 4 sequence, each address of an instruction in the optimized 

^ 5 code haying upper bits that indicate a memory address at 

lj 6 which a processing packet is stored and lower bits that 

p 7 indicate a position of processing target instruction that 

^ 8 is included in the processing packet, 
f: 9 thb assembler program comprising; 

M 10 a jlabel detecting step for detecting a label in the 

U instruction sequence that should be resolved by a 



15 
16 



19 
20 



12 difference in addresses between two specified ins Lructions, 

13 and obtaining the addresses of the two specified 

14 instructions ; 
a program counter relative value calculating step for 

calculating a program counter relative value by subtracting 

17 an address of one of the two specified instructions from an 

18 address lof another of the two specified instructions; and 
a replacing step for replacing the label with the 

program .counter relative value calculated by the program 
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counter relative value calculating step. 

i 

38. The computer-readable recording medium of Claim 37 , 

wherein the program counter relative value 
calculating step includes a lower bit subtracting substep 
and an upper bit subtracting substep, 

the lower bit subtracting substep subtracting lower 
bits of the address of the one of the two specified 
instructions from lower bits of the address of the other of 
the two ispecified instructions/ for setting a result of a 
subtraction as lower bits of the program counter relative 
value, and sending any carry generated in the subtraction 
to the dipper bit subtracting substep/ and 

thfe upper bit subtracting substep subtracting upper 
bits of |the address of one of the two specified 
instructions and any carry received from the lower bit 
subtracting substep from upper bits of the address of the 

other ofi the two specified instructions, and for setting a 

i 

result cjf a subtraction as upper bits o£ the program 
counter irelative value. 

39. The Icomputer-readable recording medium of Claim 37, 
whierein the program counter relative value 

calculating step includes a lower bit subtracting substep 

and an upper bit subtracting substep, 

thle lower bit subtracting substep subtracting lower 

bits of I the address of one of the two specified 

i 
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i 

\ 
1 

7 instructions from lower bits of the address of the other of 

8 the two specified instructions without generating a carry 

9 and setting a result of a subtraction as lower bits of the 
10 program counter relative value, and 

U the upper bit subtracting substep subtracting upper 

12 bits of the address of one of the two specified 

13 instructions from upper bits of the address of the other of 

14 the two specified instructions, and for setting a result of 

15 a subtraction as upper bits of the program counter relative 

16 value, 

i 

i 

1 40, The computer-readable recording medium of Claim 37 , 

2 wherein the program counter relative value 

3 calculating step subtracts upper bits of an address of one 

4 of the two specified instructions from upper bits of an 

i 

5 address !of the other of the two specified instructions, 

6 sets a result of a subtraction as upper bits of the program 

7 counter jrelative value, and sets lower bits of the other of 

8 the two -specified instructions as lower bits of the program 

9 counter [relative value. 

1 41. A computer-readable recording medium storing a linker 

2 program that generates object code from relocatable code 

3 that had been generated from an instruction sequence, each 

i 

4 address jof an instruction in the optimized code having 

5 upper bits that indicate a memory address at which a 

6 processing packet is stored and lower bits that indicate a 
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positioni of processing target instruction that is included 
in the processing packet, 

the linker program comprising; 

a relocation information detecting step for detecting 

a label in the relocatable code that should be resolved by 

i 

a difference in addresses between two specified 
instructions/ and obtaining the addresses of the two 
specified instructions; 

a program counter relative value calculating step for 
calculating a program counter relative value by subtracting 
an address of one of the two specified instructions from an 
address jof another of the two specified instructions; and 

a [replacing step for replacing the label with the 

5 

program .counter relative value calculated by the program 
counter relative value calculating step. 

42. The icomputer-readable recording medium of Claim 41, 

wherein the program counter relative value 
calculating step includes a lower bit subtracting substep 
and an upper bit subtracting substep, 

this lower bit subtracting substep subtracting lower 
bits of jthe address of the one of the two specified 
instructions from lower bits of the address of the other of 
the two (Specified instructions/ for setting a result of a 
subtraction as lower bits of the program counter relative 
value, ajnd sending any carry generated in the subtraction 
to the upper bit subtracting substep, and 

! 
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the upper bit subtracting substep subtracting upper 
bits of the address of one of the two specified 
instructions and any carry received from the lower bit 
subtracting substep from upper bits of the address of the 
other of 1 the two specified instructions, and for setting a 
result of a subtraction as upper bits of the program 
counter relative value* 



43* The pomputer-readable recording medium of Claim 41, 

wherein the program counter relative value 
calculating step includes a lower bit subtracting substep 
and an u^>per bit subtracting substep, 

thj* lower bit subtracting substep subtracting lower 

I 

bits of jthe address of one of the two specified 
instructions from lower bits of the address of the other of 
the two jspecified instructions without generating a carry 
and settling a result of a subtraction as lower bits of the 
program jcounter relative value , and 

thla upper bit subtracting substep subtracting upper 

! 

bits of ithe address of one of the two specified 
instructions from upper bits of the address of the other of 
the two jspecified instructions, and for setting a result of 
a subtraction as upper bits of the program counter relative 
value * ; 

i 

I 

44. The Icomputer-readable recording medium of Claim 41, 

i 

whjprein the program counter relative value 
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calculating step subtracts upper bit* of an address of one 
of the two specified instructions from upper bits of an 

address bf the other of the two specified instructions, 

i 

sets a result of a subtraction as upper bits of the program 
counter relative value , and sets lower bits of the other of 
the two specified instructions as lower bits of the program 
counter irelative value. 

I 

45. A computer-readable recording medium storing a compiler 
program that generates an instruction sequence from source 
code, ; 

th^ compiler program generating a program counter 

relative 1 value calculating instruction that is executed by 

! 

a processor, the program counter relative value calculating 

i 

instruction being an instruction that performs a 
calculation using a first value and a program counter 
relative; value and uses a result of the calculation to 
update tjhe first value, the first value being one of 

(a) a value of a program counter stored in a 
register, and 

(bj) the value stored in a program counter of the 
processor, 

wherein upper bits of the first value indicate a 
memory address at which a processing packet is stored, and 
lower bi^ts of the first value of the program counter 

indicate a processing target instruction that is included 

i 

in the processing packet. 
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46 , The icomputer-readable recording medium of Claim 45, 

whkrein the processor includes a lower bit 
calculating unit and an upper bit calculating unit, 

the program counter relative value calculating 
instruction having the lower bit calculating unit perform a 
lower bit calculation and the upper bit calculating unit 
perform ;an upper bit calculation, 

thp lower bit calculation being an addition using 
lower bi'Ls of the first value and lower bits of the value 
of the program counter relative value, where a result of 
the lower bit calculation is set as the lower bits of the 

first value and any generated carry is sent to the upper 

j 

bit calculating unit, and 

the upper bit calculation being an addition using 
upper bits of the first value, upper bits of the value of 
the program counter relative ^alue and any carry received 
from the; lower bit calculating unit, where a result of the 
upper bit calculation is set as the upper bits of the first 
value . 

i 

i 

47. The icomputer-readable recording medium of Claim 45, 

t 
i 

wherein the processor includes a lower bit 
calculating unit and an upper bit calculating unit, 

thbs program counter relative value calculating 
instruction having the lower bit calculating unit perform a 
lower bit calculation and the upper bit calculating unit 
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! 
} 

perform an upper bit: calculation, 

th£ lower bit. calculation being an addition using 
lower bijts of the first value and lower bits of the value 
of the program counter relative value that does not 
generate* a carry, where a result of the lower bit 
calculation is set as the lower bits of the first value, 
and 

th£ upper bit calculation being a calculation using 
upper bits of the first value and upper bits of the value 
of the program counter relative value, where a result of 
the upper bit calculation is set as the upper bits of the 
first vaiiue. 

48. The jComputer-readable recording medium of Claim 45, 

wherein the processor includes an upper bit 
calculating unit, 

thje program counter relative value calculating 
instruction having the upper bit calculating unit perform 
an upper bit calculation and setting lower bits of the 
program icounter relative value as lower bits of the first 
value, and 

the upper bit calculation being an addition using 
upper baits of the first value and upper bits of the value 
of the program counter relative value, where a result of 
the upper bit calculation is set as the upper bits of the 
first value. 

! 
i 
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i ABSTRACT OF THE DISCLOSURE 

When a branch instruction is decoded by the 

instruction decoders 409a - 409c, the upper 29 bits of the 

i 

PC relative value included in the branch instruction are 
sent to {the upper PC calculator 411 and the lower 3 bits 
are sent to the lower PC calculator 405. The lower PC 
calculator 405 adds the lower 3 bits of the PC relative 
value and the lower 3 bits of the present lower PC 404 and 
sends the result to the lower PC 404 as the updated lower 
PC. The, upper PC calculator 411 adds the upper 29 bits of 
the PC relative value, the upper 29 bits of the present 
upper PCj 4 03/ and a carry that may be received from the 
lower PC calculator 405, and sends the result to the upper 
PC 403 a;s the updated upper PC. 



139 



JOSEPH W. PRICE 
ALBINH GESS 
FRANKLIN D. UBELL 
DOYLE B JOHNSON 
MICHAEL J MOFFATT 
GORDON E GRAY III 
BRADLEY D BLANCHE 



Price, Gess & Ubell 

ATTORNEYS AT LAW 

2100 S.E. MAIN STREET, SUITE 250 



IRVINE, CALIFORNIA 92614-6238 



A PROFESSIONAL CORPORATION 
TELEPHONE (949)261-8433 
FACSIMILE- (949) 261-9072 
FACSIMILE (949) 261-1726 

e-mail: pgu@pgulaw.com 



DRAWINGS (40 SHEETS) 



Inventor(s): 



Shuichi Takayama et al. 



Title: 



A PROCESSOR FOR EXECUTING INSTRUCTIONS IN 
UNITS THAT ARE UNRELATED TO THE UNITS IN WHICH 
INSTRUCTIONS ARE READ, AND A COMPILER, AN 
OPTIMIZATION APPARATUS, AN ASSEMBLER, A LINKER, 
A DEBUGGER AND A DISASSEMBLER FOR SUCH PROCESSOR 



Attorney's 
Docket No. 



NAK1-BG86 



EXPRESS MAIL LABEL NO. EL180744630US 



DATE OF DEPOSIT: April 28, 1999 



o 

— 1 



o . 
crfg 



O 



CO 



' 








<» 
















g 


o 


O 








E 














e 


B 




• * » * 
















s 




fid 
H 








00 


CO 


5/5 




5 


Z 




5C 


II . 









CM 



04 

^5 






o 




o 








*t 










2 


m 

On 


Rd 










O 










O 




ex 
O 


<Ti 

i — i 


O 














o 




o 




o 














J 



CO 



C*5 



*2 



o 



3 

d 
E 



u 

d 
E 



d 
E 



2 
E 



FIG. 3 A 



21 BITS. 



21 BITS, 



21 BITS^ 1 BIT 



FIRST UNIT 



SECOND UNIT 



THIRD UNIT 



INSTRUCTION PACKET(64 BITS) 



FIG.3B 



FIR$T 
CYCLE 

SECiOND 
CYCLE 

THIRD 
CYCpLE 



UNIT 1 



PARALLEL EXECUTION BOUNDARY 

i 



UNIT 2 



t 



UNIT 3 



UNIT 4 



UNITS 



UNIT 7 



I 



UNIT 8 



UNIT 6 



UNIT 9 



E 
E 



FIGJ 3C 



FIRST 
CYCLE 



UNIT1 



SECOND 
CYCIE 



UNIT 3 



UNIT 2 



UNIT 4 



UNIT 5 



UNIT 6 



THIRD 
CYCLE 



UNIT 7 



FOURTH 
CYCLE 



UNIT 8 



UNIT 9 



j 



i 



\ 




COMPILER 




302 



optimizationV 
apparatus j 



303 




304 



ASSEMBLER \ 

1 ' 



305 




306 



307 



308 



309 



HG.6 




DATA 
MEMOjlY 



29 



PREFETCH 
LOWER 
COUNTER 



;? 3 



413 



INSTRUCTION 
MEMORY 

1 

407 



FIRST SECOND 



UNIT 



WNTT 



THIRD 
UNIT 



instruchon 

PACKET 



408 



INSTRUCTION 
BUFFER 



HG, 8A 



LOWER 3 BITS 
^QF ADDRESS 
LOW2R 3\.VALUE 

BITS OF PC 
RELATIVE VALUE 


3'bOOO 


3'bOlO 


3'bl00 


3'bOOO 


3'bOOO 


3'b010 


3'blOO 


3'b010 

I 

i 3'blOO 


3'bOlO 
3'blOO 


3'blOO 

3'bOOO 
(CARRY 1) 


3'bOOO 
(CARRY 1) 

3'bOlO 
(CARRY 1) 


HG J 8B 








; LOWER 3 BITS 
v , OF ADDRESS 
Nj VALUE(TOBE 

NSUBTRACTED) 








LOWER 3 BITSV 
OF ADDRESS \ 
VALUEl(BEFORE \ 
SUBTRACTION) \, 


3'bOO 


ObOlO 


Ob 100 


1 

\ 3'bOOO 

: 3^010 

3'bl00 


3'bOOO 
3*b010 
3'blOO 


3'blOO 
(CARRY 1) 

3'bOOO 
3'bOlO 


3'bOlO 
(CARRY 1) 

3'blOO 
(CARRY 1) 

3'bOOO 



FIG. 9 , 

assembler! 

CODE ~J 
CODE OPTIMIZATION APPARATUS] 



[opto 



OPTIMIZATION PROCESSING CODE 



903 



ADDRESS ASSIGNING MEANS Y 



904 




ADDRESS ASSIGNING CODE 



dej 



916 



LABEL DETECTING MEANS ] ^ 
* 906 



905 



HE 



iJABEL INFORMATION 




907 



•SLOWER ADDRESS 1 
nsUBTR ACTION MEANS | 

908^ - 1 



910 



UP^ER ADDRESS SUBTRACTION MEANS 



ADDRESS DIFFERENCE 
CALCULATING MEANS 



t 

Address m fference 




vctionI 
——J 

v 909 



LOWER 
SUBTRACTION 
RESULT 



>ttlABEL INFORMATION ^ 914 



^{RESOLVING MEANS 



FIG. 10 



( START ) 



INPUT OF ASSEMBLER CODiT 
1 



S9000 
S9001 



ASSEMBLER CODE QPTIMIZED| 

>k ^S9002 



PROVISIONAL ADDRESSES ASSIGNED I 

IT 1 



S9003 



LOCAL LABELS DETECTED^ 

i_ , _S9004 



LABEL INFORMATION GENERATED] 




S9006 



SUBTRACTION OF LOWER ADDRESSES | 

* T- -S9007 



SUBTRACTION OF UPPER ADDRESSES | 

V ^ S9008 



CALCULATION OF ADDRESS DIFFERENCE | 



-S9009 

CAN INSTRUCTION 
ADDRESS OR PC RELATIVE 
VALUE BE EXPRESSED 
rSING 13 BITS2- 



-S9010 



SET AS 42-BIT 
INSTRUCTION 1 



T 



JCes. 



SET AS 21-BIT Y~* 
INSTRUCTION I 



S9011 
S9012 



INSTRUCTION PACKETS GENERATED | 
1, 11 1 



C END ^ 



FIG. 11 



Ll: 


mov r2, rl 


• * 1000 






* - 1001 




add rO, r4 


• * 1002 




andrl, r3 


■ * 1003 




mov L2, r2 


• • 1004 




Id(r2) t r0 


• • 1005 




braLl 


■ • 1006 


* * • 


add rl, r3 


* • 1007 


L2: 




• • 1008 



FIG. 12 



32*h00000800 


Ll: 


mov r2, rl 


• • 1000 


32'h00000802 


isrf 


• * 1001 


32^00000804 


add rO, r4 


* • 1002 


32'h00000808 


andrl,r3 


■ ■ 1003 


32'h0000080a 


mov L2 } r2 


■ • 1004 


32'h00000810 


Id(i2).i0 


• • 1005 


32'h00000812 


braLl 


• • 1006 


32*h00000814 


add r2, r3 


♦ • 1007 




« • • 




32*hl2345678 


L2: •■« 


• • 1008 



FIG. 13 



INSTRUCTION 


RESOLVING VALUE 


mqv L2, r2 


ADDRESS 32Til2345678 


bn*Ll 


PC RELATIVE VALUE 321100000800-321100000812 



1 



FIG. 14 



Ll:i 

f 


mov x2, rl II 


jsrf || 


add rO, r4 


• • 1300 




and rl, r3 11 


rnovL2,r2 II 


(mov L2, r2) 


• • 1301 




Id(r2),r0 II 


bra LI II 


add r2, r3 


* • 1302 


ij 

! 








- • 1303 



l j j j x i,ji.*;u \j u * v < i. WlKi v ■ J 1/ y\MO Jx^vvj. j. > a a. _x *a^a 



FIG. 15 




802 



MACHINE LANGUAGE CODE GENERATING MEANS| 




MACHINE LANGUAGE CODES 




803 



LABEL DETECTING MEANS K 
^ 



804 



LABEL INFORMATION 




805 



806 



LOWER ADDRESS 

SUBTRACTION MEANS 
— wjr 



] 



^809 



UPPER ADDRESS 
SUBTR ACTION MEA NS 
W , - 



UPPER SUBTRACTION 
l&ESULT 



ADDRESS DIFFERENCE 
CALCULATING MEANS 




LOWER 
SUBTRACTION 
[RESULT 



808 




ADDRESS DIFFERENCE 
? 



:=5 L-'812 
CEJ 



LABEL INFORMATION^ 
RESOLVINq MEANS 

<ST 



RELOCATABLE 
ICODES 



5h 



814 



FIG. 16 



( S T ART ) 

V ^S15 00 

INPUT OF OPTIMIZED CODE \ 



± 



S1501 



MACHINE LANGUAGE CODES 
GENERATED AND ADDRESSED 
ASSIGNED 

j ^31502 



1 



DETECTION OF LOCAL LABELS TO 
BE RESOLVED BY PC RELATIVE VALUES 

^ ^S15Q3 

1 LABEL INFORMATION GENERATED \ 



-S1504 



SUBTRACTION OF LOWER ADDRESSES I 

j ^S1505 
I SUBTRACTION OF UPPER ADDRESSES 1 

CALCULATION OF ADDRESS DIFFERENCEl 



-S1507 



No 



\ > 


f ^S1508 


LABEL REPLACED WITH 
A 32-BIT VALUE 



CAN ADDRESS 
DIFFERENCE BE 
EXPRESSED USING^ 
13 BITS?^ 

Yes ^S150 9 

LABEL REPLACED WITH I 
LOWER 13-BIT VALUE j 



— H 



o 
-3- 



13 



o 



> 

o 
S 



o 



<2 



OS 



o 
o 
o 

s 

o 

o 

as 



o 



2 



s 



o 
o 



FIG. 18 



INSTRUCTION 


RESOLVING VALUE 1 


bra LI 


PC RELATIVE VALUE 32*h00000000-32 , h00000012 j 



8 S B 



On 

i 




FIG. 20 



RELOCATABLE! 
CODES 




CODE COMBINING MEANS p 702 



COMBINED COD ES 



J- 



703 



RELOCATION INFORMATION 
DETECTING MEANS 



I 



'704 



RELOCATION 
INFORMATION 



LOWER ADDRESS 
SUBTRA CTION MEANS I 



UPPER ADDRESS 
SUBTRACTION MEANS 




UPPER SUBTRACTION 
.RESULT 



ADDRESS DIFFERENCE 
CALCULATING MEANS 



I (ac 



ADDRESS DIFFERENCE 




706 



LOWER 
SUBTRACTION 
RESULT 

— r - 

708 



RELOCATION INFORMATION 



713 



FIG. 21 



( S TART ^ 

V " ^52000 

INPUT OF RELOCATABLE CODES | 



RELOCATABLE CODES COMBINED^ 

^ ^S2002 



RESOLVING OF ADDRESSES OF 
LABELS THAT ARE RESOLVED 
BY INSTRUCTION ADDRESSES 



1 



S2003 



DETECTION OF EXTERNAL LABELS 
THAT SHOULD BE RESOLVED BY 
PC RELATIVE VALUES 



-S2004 



BRANCH DESTINATION ADDRESSES 
INCORPORATED INTO BRANCH 
INSTRUCTIONS 



^ ^S2005 
SUBTRACTION OF LOWER ADDRESSES 1 



I ^S2006 

SUBTRACTION OF UPPER ADDRESSES^ 




1 
} 

t 

i > 


f ^S2009 


LABEL RE] 
32-BIT VAI 


PLACED WITH 1 
.UE | 



LABEL REPLACED WITH I 
13-BIT VALUE 




o ^ oo *-* v> 
O O O fmm - 
00 00 oo oo 



o o o o 



8 

00 



On 

g 



1 



s 

o 
o 



O 



o 
o 



I 

as 

CM 



S 



o 

' *» at 

o 



s 



O 

00 



<2 

i 



O 

"o 



o 
o 

OS 



o 



3 



S 

00 



<2 



(-4 



o 

o 



8 



ON 



r— 4 



o 
© 

j — ( 



o 



oo 



s 

T3 



© 



s 



o to |Q 

ON I J as 



is 



S 

ON 



9 2 
1 



© 

"o 

o 
os 



o 



S3 Lo 

CS f ^ 



> 



o 



3 



IS 



On CM 
O H 
C7s os 



.ha 



ha 



o 
©" 



o 

"©" 



O IO 

©To 



8 



o 
o 

2 

ON 

CM 



O 

o 
•a 

Os 
IN 



gio 



o J o 
o 
o 



! FIG. 25 



8 3 g ~ 

N M N 



O 



O 

r— < 



Hi 

si 



©_ 

s 



o 

• MM* 

o 
o 



o 
"o 



o 



S2 
© 

m 

iff 

§ 



© 



o 
L©L 



O 
CO 

.0 



<2 



<2 



s 

T3 



i 



"©" 



o 
o 
o 
o 

2 



"©' 
8 



o 
o 



© 

g 

© 
o 
o 

s 



O 



ON 
CM 



o 

o o 



O r-H 



r— < 



8 



SI 



o 



^3 

13 



CM 



■a 

-<=> 



s 



"o" 



o 
o 

22 



M3 



s 

8 



a 
cc 

cn 

s 

m 

i 



8 
3 



o 

cn 

CSI 



rn 
i — i 

J 

O 

■ m m m 

ri 



2> 



<2 

i 



a 



0 



o 

cf 



"o" 



§ 



i CD 



o 
$ 



GO 
CO 



Q 
P 
< 



»5 



§ 



8 

B3 
CO 



a 



o 

s 



2 

on 



O 

ft 



P 



GO 



CO 



O 
o 



to 



D 

S 

w 



oo 

© 
E 



I 



I 



CO 



g 



CO 

55 



55 
5 



CO 

s 

v6 



FIG. 



29A 



OWER 3 BITS OF 
ADDRESS 
VALUE 



LOWER, 3 
BITS OF PC 
RELATIVE VAL 



3*b000 



3 ! b010 



3'blOO 



; s'booo 


3'b000 


3'b010 


3 blOO 


ThOlO 


3'b010 


3'blOO 


3'b000 






(CARRY 








IGNORED) 


l j D1UU 




U\J\J\J 


3'bOlO 




(CARRY 


(CARRY 


l 




IGNORED) 


IGNORED) 


FIG. 29B 








1 LOWER 3 BITS 








V l OF ADDRESS 








\ VALUE (TO BB 








>^SUBTRACTED) 








LOWER 3 BrisV 








OF ADDRESS \ 
VALUE (BEFORE \ 


3*b000 


3'b010 


3'blOO 


BE SUBTRACTION) \ 








! 3'b000 


3'bOOO 


3'blOO 


3'bOlO 






(CARRY 


(CARRY 






IGNORED) 


IGNORED) 


\ 3'b010 


3'bOlO 


3'b000 


3'blOO 








(CARRY 








IGNORED) 


; 3i3ioo 


3'blOO 


3'bOlO 


3'b000 



» * • « • 
V » * * * 



o 


o 


O 


o 


■i 




-2403 












r 


• 

it 


i — i 

A 

* 


CO 

-a 

■s 








1+ m m * 

O 


S3 


. « <» — 

o 






* 

1 

o 
"o" 


g 

« 

jd 
to 

i 


8 

CO 
CA 

t— i 

J=3 
C^ 
CO 

i 


CO 

f-H 

* 

i 

* — i 
CO 

e 

» ^ 






-2401 


* 


On 
O 

• 
m 


m 




* 
* 


« 

■4— » 

<1> 


■ 

5 

e 


* 

1 


9 
S3 

TJ 








►J 






« 


• * 

-I 




_o 


o 
©" 


O 

o" 






29'hOOOOOOOO 


o 
§ 

On 
1« 


o 
o 

o 


CO 

o 
a> 




¥ 



FIG. 31 A 



SLOWER 3 BITS OF 
TV ADDRESS 
LOWER 3\VALUE 

BUS OF PC ^v 
RELATIVE VALUE V. 


3'bOOO 


3'b010 


3'blOO 


£ 3'bOOO 


3'bOOO 


3'bOOO 


3'bOOO 


!J i 3'bOlO 


3'bOlO 


3'bOlO 


3'bOlO 


P \ 3'blOO 

\ 


3'blOO 


3'blOO 


3'blOO 


1 FID. 3 IB 








1 

VJ^OWER 3 BITS OF 
>V ADDRESS 
| \^ VALUE 

LOWER 3 V. 

btt^ofpc \ 
relative value \ 


3'bOOO 


ObOlO 


Ob 100 


t 

j 3'bOOO 


3'bOOO 


3'bOOO 


3'bOOO 


3'b010 


3'b010 


3'bOlO 


3'bOlO 


j 

| 3'blOO 

1 


3'blOO 


31)100 


3'blOO 



g S § 2 £ 

\0 \0 \0 NO \D 
?5 N M (S 



© 

E 




o ^ i— * in 

o o o — *-i 

t> r*- 

CM CM CM CM 



CO 



m 
o 



B 1 

C3 



S 



s 

o 

O 



o 

o 



1 



Si 



IS 

CO 



CM 

s 



o 

o" 

1 — I 

8 
8 

O 
O 

CM 



<r4 



S3 
o 

oo 

NO 

CM 
CO 

>- 

§ 



s 

CM 



p. 

o" 



<2 



I — I 



CM 

1 1 

Is 



o 



CM 



FIG: 34 





general/ I 



401c 



REGISTERS! 
^ v 402 



UPPER 
PC 



\29 



LOWER! , 
PC I I1NC 




423 



OPERAND 
ADDRESS 
BUFFER 



I 



406 



DATA ! 
MEMORY 




420 




32 



> k 32 



\ PC RELATIVE 
VALUE SELECTO* 



410 



IMMEDIATE! 
SELECTOR I 



'0L 



msmucnoN 

DECODER 



PREFETCH UPPER 
COUNTER 



29\ 



.OL 




PREFETCH 

LOWER 

COUNTER 

T 



I 



413 



INSTRUCTION 
MEMORY I 

407 



-\ — > 

64 



DNTROL 




THl_ 

instructioi 

DECODER 



J 



801a ^ 21 ¥8Qlb^ )1c 









FIRST 
UNIT 


SECOND 
UNIT 


THIRD 
UNIT 




























INSTRUCTION 
BUFFER 



miction 

PACKET 



408 



MNEMONIC 
FIQ, 35A addpc disp , Rn 
FIG. 35B subpc disp , Rn 



OPERATION 
Rn + disp— > Rn 
Rn — disp— > Rn 



FIG. 36 




SOURCE CODE 



2901 



INTERMEDIATE CODE 
CONVERTING UNIT 



2902 




.2903 



> 




f ->><x\< \t __-2904 


INSTRUCTION L 
iCONVERUNG UNITi 

.1. 


PC VALUE ADDING 1 
INSTRUCTION I 
CONVERTING UNIT | 



ASSEMBLER 
CODE 



FIG. 37 



START 

.S3600 

I INPUT OF SOURCE CODE j 

IS THERE A 
SOURCE CODE THAT 
SHOWS A POINTER TO A^ 
FUNCTION? 

■ Yes _^S36Q2 



l¥1 



DETECTED SOURCE CODE CONVERTED 
INTO INTERMEDIATE CODE THAT ADDS 
THE PC VALUE AND A DIFFERENCE 
VAOJE FOR THE POINTER TO THE PC VALUE | 



S3603 



OTHER SOURCE CODE CONVERTED! 
INTO INTREMEDIATE CODE 1 



-S3604 



^ IS THERE AN ^ ^ 

Intermediate com that 

ADDS THE PC VALUE AND A 
^DIFFERENCE VALUE FOR A 
POINTER TO A 
JNCTTON2, 



Yes _>S3605 



No 



1 



DETECTED SOURCE CODE CONVERTED 
TO AN ADDITION OR SUBTOACTION 1 

instruction code that uses the 
Second pc calculator 



Pot 

CO 



i 



S36Q6 

OTHER INTERMEDIATE CODE 
CONVERTED INT O ASSEMBLER COD E J 



FIG. 38 



extern int gl () ; 
extern int gl () ; 
extern int g3 Q ; 
extern int g4 () ; 



f(int i) 
{ 



int (*tp) 0 ; 

switch(i) { 

case 1 : fp — gl ; 

break ; 
case 2 : fp = g2 ; 

break ; 
case 3 : fp = g3 ; 

break ; 
default : fp = g4 

} 

(*fp) 0 ; 



FIG. 39 



tmp = PC 
i ! = 1 
br LI 

fp = (gl — f) -I- tmp 
jmp L 

i ! = 2 

br L2 

fp = (g2 - f) + tmp 
jmp L 

i ! =3 
br L3 

= (g3 - 1) + tmp 
jmp L 

fp = (g4 - f) + tmp 
* (i) 



3201 

3202 

3203 

3204 

3205 

3206 

3207 

3208 

3209 

3210 

3211 

3212 

3213 

3214 

3215 



FIG. 40 



f : 


mov 


PC, rl 


3201 




compne 


l,rO 


3202 




br 


LI 


3203 




addpc 


gl-f,rl 


3204 




jmp 


L 


3205 


LI : 


cmpne 


2,r0 


3206 




br 


L2 


3207 




addpc 


g2-f,rl 


3208 




jmp 


L 


3209 


L2: 


cmpne 


3,rO 


3210 




br 


L3 


3211 




addpc 


g3-f,rl 


3212 




jmp 


L 


3213 


L3 : 


addpc 


g4— f,rl 


3214 


L: 


jsr 


(rl) 


3215 




ret 




3216 



Ha 42 



START 



S4100 



INPUT OF AN INSTRUCTION 
ADDRESS 



] 



S4101 



PACKET ADDRESS SPECIFIED, 
INSTRUCTION PACKET STORED 
IN THE INSTRUCTION BUFFER 



>S4102 



IN-PACKET ADDRESS SPECIFIED, 
INSTRUCTION TAKEN 



I 




-S4109 



ASSEMBLER 

INSTRUCTION 

DISPLAYED 



SUBTRACTION OF LOWER 1 
ADDRESSES 1 




t _.S4105 


SUBTRACTION OF HIGHER 1 
ADDRESSES | 


> 


f ^S4106 


SETTING OF BRANCH 1 
DESTINATION ADDRESS | 


> 


t ^S4107 


LABEL NAME FOUND IN LABEL 1 
TABLE 1 


> 


f _>S4108 


ASSEMBLER INS 
LABEL NAMED] 


iTRUCTION ANDl 
[SPLAYED 1 


^ 





FIG. 43 




-S4200 



INSTRUCTION ADDRESS AND I 
REPLACEMENT INST R UCTION INPUTTED | 



.S4201 



PACKET ADDRESS SPECIFIED, INSTRUCTION 
PACKET STORED IN INSTRUCTION BUFFER 

4> ^S4202 
IN-PACffET ADDRESS SPECIFIED| 



I 



S4203 




Yes 



,S4204 




INSTRUCTION IN FIRST 
UNIT REPLACED WITH 
REPLACEMENT 
INSTRUCTION 



•S42G6 



mSTRUCTION IN THIRD UNIT 
REPLACED WITH REPLACEMENT! 
INSTRUCTION 



INSTRUCTION IN 
SECOND UNIT 
REPLACED WITH 
REPLACEMENT 
INSTRUCTION 



I 



S4208 



INSTRUCTION PACKET IN THE INSTRUCTION 
MEMORY REPLACED WITH INSTRUCTION 
PACKER IN THE INSTRUCTION BUFFER 



C END } 



JOSEPH W PRTCE 
ALBIN H. GESS 
FRANKLIN D. UBELL 
DOYLE B JOHNSON 
MICHAEL J. MOFFATT 
GORDON E GRAY III 
BRADLEY D BLANCHE 



Price, Gess & Ubell 

ATTORNEYS AT LAW 

2100 S.E, MAIN STREET, SUITE 250 
IRVINE, CALIFORNIA 92614-6238 



A PROFESSIONAL CORPORATION 
TELEPHONE- (949) 26 1-8433 
FACSIMILE. (949) 261-9072 
FACSIMILE- (949)261-1726 

e-mail: pgu@pgulaw.com 



DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 
(unexecuted) 



Inventor(s): 



Shuichi Takayama et al. 



Title: 



A PROCESSOR FOR EXECUTING INSTRUCTIONS IN 
UNITS THAT ARE UNRELATED TO THE UNITS IN WHICH 
INSTRUCTIONS ARE READ, AND A COMPILER, AN 
OPTIMIZATION APPARATUS, AN ASSEMBLER, A LINKER, 
A DEBUGGER AND A DISASSEMBLER FOR SUCH PROCESSOR 



Attorney's 
Docket No. 



NAK1-BG86 



EXPRESS MAIL LABEL NO. EL180744630US 



DATE OF DEPOSIT: April 28, 1999 



ATTORNEY DOCKET NO. 
NAK1-BG86 



COMBINED DECLARATION AND POWER OF ATTORNEY 
IN ORIGINAL APPLICATION 

As a below named inventor, I hereby declare that my residence, post office address and citizenship are as stated below next to my name; that I veriiy believe 
that I am the original, first and sole inventor (if only one name is listed below) or joint inventor (if plural inventors are named below) of the subject matter which 
is claimed and for which a patent is sought, entitled: 

A PROCESSOR FOR EXECUTING INSTRUCTIONS IN UNITS THAT ARE UNRELATED TO THE UNITS IN WH ICH 
INSTRUCTIONS ARE READ AND A COMPILER, AN OPTIMIZATION APPARATUS > AN ASSEMBLER, A LINKER , 
A DEBUGGER AND A DISASSEMBLER FOR SUCH PROCESSOR ___ 

the specification of which: 

(check one) Qt is attached hereto. 

□ was filed on as 



Application Serial No, m 
and was amended on 



; Q (if applicable) 

;gf hereby state that I have reviewed and understand the contents of the above-identified specification, including the claims, as amended by any amendment 
rfeferred to above, 

jsj acknowledge the duty to disclose to the Office all information known to me to be material to patentability as defined in Title 37, Code of Federal 
^gulations, § 1^6. 

H hereby claim foreign priority benefits under Title 35, United States Code, § 1 19 of any foreign application® for patent or inventor's certificate listed below 
an3 have also identified below any foreign application for patent or inventor's certificate having a filing date before that of the application on which priority is 
$ajmea\ 

Storeign application®, if any, for Patent or Inventor's Certificate Filed Within 12 Months Prior to the Filing Date of This Application: 

Iji Country Application No. Date of Filing Priority Claimed 

: 2 (day, month, year) Under 35 U.S,C 119 

; g JAPAN Jt(Ktt8325 April 28, 1998 



Yes x 


No 


Yes 


No 


Yes 


No 


Yes 


No 


Yes 


No 


Yes 


No 



All Foreign Applications, if any, for Patent or Inventor's Certificate FHed More Than 12 Months Prior to the Piling Date of This Application; 
Country Application No, Date of Filing 



: I hereby claim the benefit under Title 35, United States Code, § 120 of any United States application® listed below and, insofar as the subject matter of each 
of the claims of this application is not disclosed in the prior United States application in a manner provided by the first paragraph of Title 35, United States 
Code, § 1 12, 1 acknowledge the duty to disclose to the Office all information known to me to be material to patentability as defined in Title 37, Code of Federal 
Regulations, § 1.56 which occurred between the filing date of &e prior application and the national or PCT international filing date of this application: 



(Application Serial No.) (Filing Date) (Scams) 

(patented, pending, abandoned) 



(Application Serial No.) 



(Filing Date) 



(Status) 

(patented, pending, abandoned) 



I hereby appoint the following attorney(s) and/or agent(s) to prosecute this application and to transact all business in the Patent and Trademark Office connected 
therewith: 

Joseph W. Price (Reg. 25,124) 
Franklin D. Ubell (Reg. 27,009) 
Albin H. Gess (Reg. 25,726) 
Gordon E. Gray, III (Reg. 42,602) 

Send Correspondence To: 

Price, Gess & Ubell 

2100 S.E . Main Street, Suite 250 
Irvine, California 92614 
Telephone: 949/261-8433 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; and 
further, that these statements were made with the knowledge that willful false statements and the like are punishable by fine or imprisonment or both (18 U.S.C. 
1001) and may jeopardize the validity of the application or any patent issued thereon. 



FULL NAME OF SOLE OR FIRST INVENTOR 

Shuichi TAKAYAMA 


INVENTOR'S SIGNATURE 


DATE 


IgSIDENCE 

tS-22-6 Nakayamadai , Takarazuka-shi, Hyogo-ken 665-0876 JAPAN 


CITIZENSHIP 

Japanese 1 


sPJDST OFFICE ADDRESS 

Same as Above 


jEULL NAME OF SECOND JOINT INVENTOR, (F ANY 

ipajime OGAWA 


INVENTOR'S SIGNATURE 


DATE 


j'JpSIDENCE 

: p22-7 Korinishinocho, Neyagawa-shi Osaka-£u 572-0089 JAPAN 


CITIZENSHIP 

Japanese 


^POST OFFICE ADDRESS 

QSame as Above 


JFJJLL NAME OF THIRD JOINT INVENTOR, IF ANY 

j^Kenichi KAWAGUCHI 


INVENTOR'S SIGNATURE 


DATE 


^^3^-1-208 Uozakiminamimaclu, Higashinada^ku* Kobe-shi 
iflHvoso-ken 658^0025 JAPAN* 


CITIZENSHIP 
JaDanese 


POST OFFICE ADDRESS 

Same As Above 


FULL NAME OF FOURTH JOINT INVENTOR, IF ANY 

Nobuo HIGAKI 


INVENTOR'S SIGNATURE 


DATE 


R ^?ining-Sato 2H, 4^15^26, Komatsu, Higashiyodogawa^ku 
Osaka -shi. Osaka-fu 533-0004 JAPAN* 


CITIZENSHIP 

Japanese 


POST OFFICE ADDRESS 

Same As Above 


FULL NAME OF FIFTH JOINT INVENTOR, IF ANY 

Kensuke ODANI 


INVENTOR'S SIGNATURE 


DATE 


R ^ DE S&imotorida-cho 3 Murasakino, Kita^-ku, Kyoto^shi, Kyoto-fu 
603-8175 JAPAN 


CITIZENSHIP 

Japanese 


POST OFFICE ADDRESS 

Same As Above 


FULL NAME OF SIXTH JOINT INVENTOR, IF ANY 

Tetsuya TANAKA 


INVENTOR'S SIGNATURE 


DATE 


RESIDENCE 

2-4-1-302 Kuragakiuchi, IbarafcL^shi, Osaka-fu 567-0878 JAPAN 


CITIZENSHIP 

Japanese 


POST OFFICE ADDRESS 

Same as Above 



Doyle B. Johnson (Reg. 39,240) 
Michael J. Moffatt (Reg. 39,304) 
Bradley D. Blanche (Reg. 38,387) 



f 

/ I hereby appoint the following attorney® and/or agent(s) to prosecute this application and to transact all business in the Patent and Trademark Office connected 
therewith: 

Joseph W. Price (Reg. 25, 124) Doyle B. Johnson (Reg. 39,240) 

Franklin D. Ubell (Reg. 27,009) Michael J. Moffatt (Reg. 39,304) 

Albin H. Gess (Reg. 25,726) Bradley D. Blanche (Reg. 38,387) 

Gordon E. Gray, HI (Reg. 42,602) 

Send Correspondence To: 

Price, Gess & Ubell 

2100 S.E. Main Street, Suite 250 
Irvine, California 92614 
Telephone: 949/261-8433 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; and 
further, that these statements were made with the knowledge mat willful false statements and lie like are punishable by fine or imprisonment or both (18 U.S.C. 
1001) and may jeopardize the validity of the application or any patent issued thereon. 



full name OF SEVENTH JOINT INVENTOR 
Shinya MIYAJI 


INVENTOR'S SIGNATURE 


DATE 


4 R ^-5-T65, Yamadaikehigashimachi, Hirakata-shi Osaka-fu 

O 573-0165 JAPAN 


CITIZENSHIP 

Japanese 


PQff OFFICE ADDRESS 

Sajff As Above 


Fug.NAMEOF EIGHTH /JOINT INVENTOR 
Tafeto HEISHI 


INVENTOR'S SIGNATURE 


DATE 


l R -^^ N -5oi, Higashiawaji, Higashiyodogawa-ku, Osaka-shi, 
Osaka-fu 571-8501 JAPAN 


CITIZENSHIP 

Japanese 


POft OFFICE ADDRESS 

Sajnp As Above 


FUg£ NAME OF JOINT INVENTOR, IF ANY 

\% i 


INVENTOR'S SIGNATURE 


DATE 


R^PENCE 


CITIZENSHIP 


PQ|T OFFICE ADDRESS 


FULL NAME OF ' JOINT INVENTOR, IF ANY 


INVENTOR'S SIGNATURE 


DATE 


RESIDENCE 


CITIZENSHIP 


POST OFFICE ADDRESS 


FULL NAME OF iOINT INVENTOR, IF ANY 


INVENTOR'S SIGNATURE 


DATE 


RESIDENCE 


CITIZENSHIP 


POST OFFICE ADDRESS 


FULL NAME OF JOINT INVENTOR, IF ANY 


INVENTOR'S SIGNATURE 


DATE 


RESIDENCE 


CITIZENSHIP 




POST OFFICE ADDRESS 



